SENSORY 

COMMUNICATION 



M.l.T. Books in the Communications Sciences 


Sensory Communication 

Edited by Waller A- Rosenbbth 

ProcEssiNC Neuroelectric Data 

By Walter A Rosenbbth and Members of the Communications Biophysics 
Group of tfie M I T Research Laboratory of Electronics 

Cybernetics or Control and Communication in the Animal and 
the Machine Second Edition 
By Norbert Wiener 

Iterative Arrays of Logical Circuits 
By Frederick C Henme III 

Transmission of Information 
By Robert M Fano 

ERrOR-CoRRECTINC CODES 
By W Wesley Peterson 

Sequential Decoding 

By John M Wozen craft and Barney ReifFen 

Style in Lancuace 

Edited by Thomas Sebeok 

Worn and Object 
By Wflkud Van Orman Quine 

Nontjnear Problems in Random Theof.y 
By Norbert Wiener 

On Human Communication A Review , a Survey, and a Criticism 
By Colin Cherry 

Language, Tiioucht, and Reality 
By Benjamin Lee Whorf 

Machine Translation of Lancuaces 
Edited by William N Locke and A. Donald Booth 


Tun Extrapolation, Interpolation, and Smoottunc 
Time Series wmi Encineerinc Applications 
By Norbert Wiener 


op Stationary 



SENSORY 

COMMUNICATION 

CONTRIBUTIONS TO THE SYMPOSIUM ON 
PRINCIPLES OF SENSORY COMMUNICATION 
July 19~ August I, 1959, Endicotl House, MJ.T. 


WALTER A. ROSENBLITH, Editor 


MLSU- CENTRAL LIBRARY 



1S376EX 


PUBLISHED JOINTLY BY 

THE M.l.T. PRESS 

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
AND 


JOHN WILEY & SONS, INC 
NEW YORK ■ LONDON 



comicirr © ieai by 

THE MASSACHUSETTS INSTITUTE OF TECHNOLOGY 


ALL niGllTS RESERVED 

SECOND PRINTING. MAY. 1D62 ^ 

Reproduction In whole or In part is permitted for 
any purpose of the United States Government. 

The symposium reported herein was partially supported by the Air Force Office 
of Scientific Research under contract AF 49 (678) -421, monitored by the Infor- 
mation Sciences Division, Directorate of Mathematical Sciences. AFOSR 796 


library cl Conzrm Catalog Card No G 1-8795 
rrr-rrrn cs na cvcrm stjatts or amoica 




The postwar period has seen a surfeit of interdisciplinary symposia 
These symposia were a manifestation of several not unrelated facts the 
scientific universe was expanding at a rate that was hard to grasp or assess 
and the frontiers between the sciences seemed ta be moving almost as 
much as science itself Communication between disciplines had become 
more and more sketchy, and even within a given discipline communication 
had often become problematical At the same time many of the traditional 
fields among the life and behavioral sciences found themselves profoundly 
affected by the technological advances that had their origin m the physical 
sciences Experts in these technologies acquired a taste for the challenges 
of these less structured fields, but they sometimes lacked perspective 
and respect for the toughness of the problems that they proposed to 
tackle 

It was with these difficulties that the numerous symposia attempted 
to grapple Like other scientific meetings these interdisciplinary symposia 
were unequal in quality At the outset the freshness of the confrontation, 
the unorthodox approaches, promised a great deaL But as time wore on 
it became clear that while the symposia were productive of suggestive 
ideas they were no substitute for the workaday interaction of expenmenta 
bon and theory making Responsible workers in the behavioral and life 
sciences became increasingly squeamish about the one-day symposium m 
which mathematicians, physicists and engineers vented frequently the 
belief that the intelligent application of some rather elementary notions 
from mathematics and physics should yield spectacular results in the 
solution of a variety of thorny problems The brain, perception, learning 
thinking, that is to say, all topics that m Warren Weavers phrase are 
characterized by "organized complexity," were the explicit target of these 
optimistic predictions 

The biological scientists who had been trained in the more traditional 
approaches adopted a retaliatory stance m winch they were only too often 
satisfied to point to the truly appalling complexes of their experimental 
data The two parties separated, leaving behind numerous volumes that were 
represented to be high fidelity transcriptions of the tape recordings of these 
encounters Relatively few among the symposiasts got beyond the hors 
d oeuvres and stayed, as it were for dinner 

And yet these intellectual skirmishes made their impnnt upon the 
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hnd of questions that laboratory anti computer scientists lx gin to asV 
and on the results they obtained Here were at least partial payments on 
the perhaps overgcncroiis promissory notes 

Throughout the nineteenth century sensory problems had proved attractive 
to a variety of scientist* who had also worked in the physical science v-— 
young Ohm, Helmholtz. Fcchner Mach to name just a few The twentieth 
century perpetuated and modified this tradition Electronic and communion 
tions engineers concerned with the telephone and with radio television 
and photography discovered their stale in the understanding of mans 
sensory performance The engineers were largely responsible for pointing 
out how fruitful the application of theoretical concepts 1 la Wiener Shannon 
might be in an anal) sis of human information handling The advent of 
large computers and of complex weapons systems put a premium on our 
ability to couple man to these machine* that seem so capable of simulating 
and amplifying many aspects of man’s intelligent behavior 
Thus there arose albeit in a changed context, a renewed concern with 
perception Attempts to program computers to recognize certain patterns 
to translate from one language into another, and more generally to solve 
certain classes of logical problems led to an emphasis upon min's cognitive 
functions, Vo a reconsideration of the role that sensory processes play in 
this type of symbolic behavior But the formulation that was couched in 
terms of the processing of sensory information enabled the scientists of 
the post World War II period to transcend (perhaps only to bypass or even 
sidestep) the traditional dichotomy between sensation and perception 
It thus appeared worth while to bring together a group of scientists 
and engineers who had themselves done research on problems of sensory 
communication and who were willing to listen to neurophysiologists expound 
up-to-date neurophysiology, or psychophystcists talk about contemporary 
psychophysics instead of being satisfied with their own version of the 
other man’s science 

These considerations led late m 1957 to the not entirely original idea 
of a mulhdiscipbnary international symposium on principles of sensory 
communication Before too long the Office of Scientific Research of the 
V S Air Force expressed interest in sponsoring such a meeting provided 
that the collaboration of a representative group of scientists could be 
assured. In the fall of 1953 an international organizing: committee was 
formed consisting of F Bremer, T H Bullock, A Fessard R Galambos, 
W D Neff C Pfaffmann S S Stevens and W A Rosenbhth 

Thanks to the active support of this committee we were able to assemble 
a group whose broad ranging competence in the area of sensory commumca 
bon was unquestionable We regretted, however, that a variety of cucum 
stances forced a few scientists to decline our invitation and left us with a 
somewhat less "balanced representation' than we bad hoped for 

If communication on sensory communication was to take place, the 
number of symposium participants had to be limited. The number of full 
fledged participants who could be accommodated was further limited by 
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our conviction that appropriate living and dining arrangements are requisite 
tor the effective interchange of ideas We were grateful, therefore, to our 
guests (practically all of whom were from the Boston area) who were willing 
to attend single sessions only during the first week, and who nevertheless 
participated so fruitfully m the deliberations of the working groups during 
the second week We appreciated also the helpful participation of H O 
Wooster, L Butsch, H O Parrack, H Savely, and J Steele, who attended 
the meetings as representatives of the sponsor, the U S Air Force, whose 
generous, nondirective support it is a pleasure to acknowledge 

The symposium was held during the last two weeks of July 1959 at 
s Endicott House in Dedham a Boston suburb Sunday night Profes 
sor Jerome B Wiesner, Director of the Research Laboratory of Electronics 
and Chairman of the Steering Committee of its Center for Communication 
Sciences, opened the proceedings by extending the Institute’s official 
welcome Thereafter, the participants and guests at this inaugural session 
listened with evident pleasure to E G Boring Edgar Pierce Professor 
of Psychology Emeritus at Harvard University, who applied his wisdom, in- 
sight, and wit to a review of the history of afferent communication 
After this auspicious send off, the more formal sessions started on Monday 
morning and continued throughout the first week, every morning and 
every evening after dinner The afternoons were left open for laboratory 
visits,* informal discussions, walks in the park, or dips in the Endicott 
House pool The chapters in this volume correspond by and large to the 
quasi formal presentations of the first week After a week end break most 
of the participants reassembled in several working and discussion groups 
There was no attempt to record these exchanges of ideas and data on tape, 
and we can only hope that the section on comments and the final versions 
of chapters in this volume reflect some of the many cogent points that were 
raised, often in an informal manner 

Of the several pleasant social occasions that accompanied the symposium 
one deserves special mention a reception and tea at Brandegee House, 
tendered to the participants by the American Academy of Arts and Sciences 
Hudson Hoagland and Ralph Burhoe acted as hosts on behalf of the 
Academy Council and the Committee on Informal Gatherings Hoagland s 
warm words of welcome evoked a delightfully spirited response from 


W A H Rushton, FRS 

Rushton said, “It was during the American Revolution that this Academ) 
was founded, and I suppose the chief factors were the activity of great 
Boston characters and the disappearance of the King of England \ ou 
never know what circumstances will found in Academy, and wath our Ro>a! 
Society (upon which >ou said your Academy was modeled) the conditions 
were just the other way Tor with the death of Oliver Cromwell die King 

•It is a pleasure to thank E II Land, F A Webster DR C&tD Raccler. 
J Y Leitvin II B Matunma, S W Kuffler D II Hubei and T 
arranged these laboratory visits and demonstrations which were much appreciated 
by the symposium participants 
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of England reappeared and it war the Puritan! wlio went— to Boston a! I 

suppose , . , i.i 

The descendants of these Boston cmigrecs— John Adams and the rest—- 
were the men who gave you >our noble and liberal Charter, parts of which 
we should like to take as an ideal for the matter and mnnner of our meetings 
For m matter we too wish ‘to promote and encourage medical discoveries, 
mathematical disquisitions, philosophical inquiries and experiments, and 
in manner I hope that our meetings 'may tend to advance the Interest, 
honour, dignity and happiness of free, independent and virtuous people 
A given scientific gathering owes a large share of its success to a great 
number of people, many of whom are nonscientists Acknowledgment of 
this debt would gam in meaning were it accompanied b) a functional flow 
diagram of auxiliary activities How otherwise can one do justice to the 
efforts of those who struggle with the ever present inadequacies of slide 
projectors, who act as chauffeurs and guides, who perform the multifarious 
duties that make intellectual discussions at an international scale possible 9 
This meeting had its oversize and overtime share of such efforts No one 
showed more dedication m planning and operation than my assistant, Mrs 
Aunce Albert Her devotion to the success of the symposium fully merited 
the high tribute the participants paid her in the final session 

The atmosphere of the symposium owed much to the good management 
of Mrs Gertrude Wmquist, who presides over Endicott House Most of 
the participants lived in these comfortable and congenial surroundings, and 
all of them socialized there repeatedly 

The staff of the Research Laboratory of Electronics under the direction 
of R A SayeTs, provided many services with customary efficiency and 
dispatch. My colleagues from the Laboratory’s Communications Biophysics 
group unselfishly gave of their time in a variety of ways, lor example, they 
provided several demonstrations of the processing of neuroelectnc data by 
means of high-speed electronic computers My colleague. Dr Eda Berger 
Vidale, contributed in many ways to the production of this volume in 
particular, she spent untold hours in the compilation of the subject index 
Finally, I should like to express my personal gratitude to Geraldine Stone 
She has been much more than an assistant to the editor of this volume or 
an intermediary between the authors and the editor Her knowledgeable, 
persistent, sensitive and indefatigable efforts are visible on almost every 
page of this book. Miss Stone collaborated with Miss Constance D Boyd 
of the MIT Press, who contributed her measure of serious and pahent 
labors No editor could hope for more capable and dedicated associates 


Cambridge, MassachuAett * 
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S S STEVENS 

Harvard Univert ty 


I he Psychophysics 
of Sensory Function 


An inquiry into the nature of the sensory process begins properl) 
iwtfi psychophysics, the hundred year old discipline concerned with 
the responses that organisms make to the energies of the enuron 
ment We live in a restless world of energetic forces, some of which 
affect us and some of which, like radio waves, impinge upon us and 
pass unnoticed because we have no sense organs able to transduce 
them But we see lights, hear sounds, taste substances and smell 
vapors, and it is these elementary facts of psychophysics that stir our 
interest in the anatomy and ph)siology of the mechanisms that make 
sensation possible An orderly and systematic account of sensory 
communication must include a delineation of what is perceived as 
well as an explanation of how perception is accomplished In this 
sense, psychophysics defines the challenge it tells what the organism 
can do and it asks those who are inspired by such mysteries to try, 
with scalpel, electrode, and test tube, to advance our understanding 
of how such wonders arc performed 
It must be confessed at the outset that psychophysics lias often 
failed to do its part of the job with distinction Its task is not easy 
For one thing longstanding prejudices, derived in great measure 
from a chronic dualistic metaphysics, have triggered a variety of 
stubborn objections whenever it has been proposed that sensation 
may be amenable to orderly and quantitative investigation }ou 
cannot, the objectors complain, measure the inner, private, subjective 
strength of a sensation Perhaps not, in the sense the objectors have 
in mind, but in a different and very useful sense the strength of a 
sensation can, as wt shall see, be fruitfully quantified- \\ c must forgo 
arguments about the private life of the mind and ask sensible 
objective questions about tlic input-output relations of sensory tram 
ducxrs as these relations arc disclosed in the behavior of experimental 
org inisms, whether men or animals 
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Another difficulty is that psychophysics had an unfortunate cluld- 
hood. Although Plateau in the 1850 s made a half hearted attempt 
to suggest the proper form of the function relating apparent sensory 
intensity to stimulus intensity, he was shouted down by Fechner, 
who saddled the infant discipline with the erroneous “law” that bears 
his name (see Stevens, 1957b) Perhaps the hardest task before us is 
to clear the scientific bench top of the century-old dogma that sen 
sation intensity grows as the logarithm of stimulus intensity (Fechner’s 
law) The relation is not a logarithmic function at all On more than 
a score of sensory continua it has now been shown that apparent, or 
subjective, magnitude grows as a power function of stimulus intensity, 
and the exponents of the power function have been found to range 
from about 0 33 for brightness to about 3.5 for electric shock (60 cps) 
applied to the fingers There seems to exist, in other words, a simple 
and pervasive psychophysical law, a law that was once conjectured 
by Plateau and later abandoned by him, a law that is congenial not 
only to the mounting empirical evidence, but also to certain reasonable 
principles of theory construction (Luce, 1939) There will be more 
to say about the power law, but first a few words about Fechner 
The misconception began when Fechner, m 1850, espoused the 
view that error itself provides a unit of measurement He called it 
the just noticeable difference (jnd) Under most circumstances the 
jnd is a statistical concept, a measure of the dispersion or variability 
of a discriminatory response, in short, a measure of error In deriving 
his logarithmic law, Fechner made the erroneous assumption that 
error is constant all up and down the psychological scale Although 
he was willing to assume that the stimulus level error is relative, 
that is, A£ -= k<*> (Weber’s law), he assumed that at the psychological 
level equals a constant. From these two assumptions he derived 
the relation ^ ■= k log <f>, and thereby caused much mischief 
It is curious indeed that Fechner, a physicist, should have assumed 
that error, or variability of judgment, is constant all up and down 
the psychological continuum Most variables do not behave that way 
On the continua with which a physicist most often deals, error is 
usually not constant, but tends to vary with magnitude It is per- 
centage error that typically stays constant precision can generally 
be stated as one part in so many 
Suppose Fechner had taken this as his model, not only for the 
stimulus jnd A* hut also for the subjective jnd AO He then could 
have wntten 
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from which it would follow that the psychological magnitude ifr is a 
power function of the physical magnitude <£ But he fought off this 
suggestion when it was first made (by Brentano), and with Fechner’s 
temporary victory psychophysics entered upon a period of futility 
during which there seemed to be no more interesting work to do than 
measure the jnd And the logarithmic lav' became “an idol of the 
den”. 

So much for the past Smce the 1930 s psychophysics has been 
staging a comeback New interest in the age old problem of sensory 
response has been kindled by the invention of procedures for assessing 
the over all input-output operating characteristics of the intact sensory 
system These methods show that sensory response grows according 
to a power law So rarely does it happen in the study of behavior 
that a simple relation can be shown to hold under many diverse kinds 
of stimulation, that the widespread invariance of the power law be 
comes a matter of large significance 


Measurement 

The problem of the laws that govern the reactions of sentient 
organisms is intimately bound up with the problem of measurement 
Since the theory of measurement was thoroughly explored in another 
symposium (Stevens, 1959b), it need not divert us here It may be 
helpful, however, to refer to Table 1, which attempts a systematic 
classification of scales of measurement in a compact form (Stevens, 
1946a, 1951) The four scales listed, nommal, ordinal, interval, and 
ratio, are those most commonly used m the business of science and all 
of them get involved in research on sensory communication 

The nommal scale, the most general of the lot is not always thought 
of as a form of measurement, mainly because names or letters, rather 
than numbers, are most often employed to designate the categories 
or classes used in nominal scaling Yet this ubiquitous and important 
form of measurement goes on constantly, for it includes the process 
of identifying and classifying Mostly we take only a casual interest 
in such problems, but our interest has a way of turning into animated 
curiosity when it becomes a question of doing the detective work 
necessary to pm the proper labels on the functional parts of the central 
nervous system The identification of the “areas’ associated with this 
or that sensory process constitutes a lively exercise in nominal scaling 
And needless to say, much of our scientific effort in tins field never 
goes beyond the essential and basic nominal level The ordinary 
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Table 1 A Classification of Scale* of Measurement 

Measurement is the assignment of numbers to objects or events according 
to rule The rules and the resulting lands of scales are tabulated below 
The basic operations needed to create a given scale are all those listed m 
the second column down to and including the operation listed opposite the 
scale The third column gives the mathematical transformations that leave 
the scale form invariant Any number x on a scale can be replaced by 
another number x* where x' is the function of x listed in column 2 The 
fourth column lists, cumulatively downward, examples of statistics that show 
m variance under the transformations of column 3 (the mode, however, is 
invariant only for discrete variables) 


Basic Mathematical Permissible 

empirical group- statistics Typical 

Scale operations structure (mvanantive) examples 


Nominal Determma 
bon of 
equality 


Permutation group Number of cases 


where f(x) 
means any 
one to-one 
substitution 


Mode 
“Information'’ 
measures 
Contingency 
correlation 


**\ umbering" of 
football players 
Assignment of type 
or model num 
bers to classes 


Ordinal Determma 

Isotonic group 

bon of 

*W(*) 

greater 

where /(x) 

or less 

means any 


increasing 


monotomc 


function 


Median 
Percentiles 
Order correlation 
(type 0 m 
terpreted as a 
test of order) 


Hardness of 
minerals 

Grades of leather, 
lumber, wool, 
and so forth 
Intelligence- test 
raw scores 


Interval 


Ratio 


Detenu ma Linear or afHne 

bon of the group 
equality of x*— az-f-h 

intervals 6 > 0 

or of 
differ 
ences 


Mean 

Standard devia 
bon 

Order correlation 
(type I inter 
preted as r) 

Product moment 

(r) 


Temperature 
( Fahrenheit and 
Celsius) 

Position on a line 
Calendar time 
Potential energy 
Intelligence test 
“standard, 
scores” (?) 


Determma Similarity group 

bon of the X'=cr 

equality C > 0 

of ratios 


Geometric mean Length, density. 
Harmonic mean munerosity. 

Per cent variation fame intervals, 

work, and so 
forth 

Temperature 
(Kelvin) 
Loudness (sones) 
Brightness ( brills) 
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determination of thresholds which involves the categorization of 
stimuli into classes (for example seen and not seen), is another im 
portant instance of nominal scaling (Stevens, 1958) 

The hey to the nature of the four kinds of sc des lies m a powerful 
but simple principle the concept of invariance When we have 
earned out a senes of empirical operations balancings comparisons, 
ordenngs, and so on, we assign a set of numbers to reflect the out 
come of the operations This is the essence of measurement But 
what land of measurement have we achieved? That depends on the 
answer to the decisive question m what ways can the scale numbers 
be transformed without loss of empirical information? As shown in 
Table 1, each of the scales has its group of permissible transformations 

The ratio scale, the scale of greatest interest, allows only multi 
plication by a constant as when we change from inches to cenh 
meters No more general transformation is allowed If an arbitrary 
constant were to be added to the measured diameters of a set of nerve 
fibers, for example, the resulting numbers would tell us less than we 
knew before We would have lost some valuable inform«abon namely 
our knowledge of the ratios among the fiber diameters Which fiber 
is twice as thick as some given fiber would now, with the altered scale 
values, be impossible to tell In general therefore, the more re 
stneted are the admissible transformations, the more the scale is able 
to tell us 

As regards the measurement of sensabon the schema of Table 1 
suggests that our aspirabon should be to measure, where possible 
on a ratio scale This would call for assigning numbers to sensory 
magnitudes in such a way that anything more drashc than mulb 
plicabon by a constant would result in a loss of informabon Several 
variations on such a procedure have been elaborated (Stevens 1958) 
but before we consider the resulbng scales certain distmchons need 
to be made (Some nominal scaling needs to be done!) 


Sensory Qualities 

An obvious thing about sensations is that they differ in both kind 
and amount Sweet is different from sour, but both may vary from 
weak to strong The sensory qualities get named and classified 
(nominal scaling), but we try to measure the subjective intensities 
on higher order scales 

The dishncbve quality aroused by a given sensory excitation pro 
sent s a baffling problem for which no plausible explanation is yet 



6 S 5 5TCVEMS 

available Why does a sound differ Irom a taste in the wa> it docs? 
This qualitative aspect of the sensoty «orld confronts us with a 
baffling succession of discontinuous leaps as we go horn one sense 
modality to another, and no oul seems to know ssliy On the other 
hand it is at precisely tins lesel that the anatomist and the neuro- 
physiologist join the game and perform some of their most effective 
work in tracing pathways for the various modalities, and even tor 
some of the separate qualities within a modality Clearly these 
problems of topography need to be clanDed before an understanding 
of the sensory mechanisms can be anchored in the soup and sub 
stance of neural process Connections by themselves may not explain 
it all, but connections are there, and it seems improbable that they 
count for nothing 


Two Kinds of Continua 

Psychophysics progresses beyond the elementary task of naming 
sensory qualities as soon as it becomes concerned with sensations that 
appear to lie on a continuum of some sort A continuum seems 
clearly to be involved when sensations vary m strength or intensity, 
but certain other attributes of sensory response seem also to form 
continua in the ordinary sense of the term 

It would greatly simplify the mission of psychophysics if all the 
sensory continua obeyed the same rules, and did so in an invariant 
fashion It turns out, however, that a basic distinction needs to be 
made between two kinds of continua, prothetic and metathetic Loud 
ness, for example, is protheUc, pitch is metathetic An important 
difference between the psychophysical functions governing pitch and 
loudness is this the jnd for pitch represents a constant distance on 
the scale of subjective pitch, measured in mels, whereas the jnd for 
loudness represents an increasing distance on the subjective scale of 
loudness, measured in sones (Stevens and Volkmann, 1940) In other 
words, provided they are measured in subjective units, the jnd for 
pitch is constant, but the jnd for loudness grows rapidly larger as 
loudness is increased The uniformity of sensitivity or resolving 
power on the pitch continuum and the nonumformity on the loudness 
continuum entail several other functional differences between pitch 
and loudness These are discussed elsewhere (Stevens, 1957£>) 

The prothetic continua (loudness, brightness, and subjective in 
tensity in general) seem to be concerned with how much The meta 
thetic continua (pitch apparent azimuth, apparent inclination) have 
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to do with what kind or where (position) Corresponding to these 
two functional classes there seem to be two basic physiological 
mechanisms Sensory discrimination can be mediated by either of 
two processes the one additive the other substitutive (Stevens 
1946b) We detect for example an increase in loudness when 
excitation is added to excitation already present We detect a change 
in pitch when new excitation is substituted for excitation that has 
been removed Or to consider another modality we can tell when 
a light pressure changes to a strong pressure at a given point on the 
skin (addition of excitation) and we can also tell when a stimulus 
is moved from one to another location (substitution of excitation) 
Whether all perceptual conhnua that behave in the prothetic manner 
are mediated by additive physiological processes is not certain of 
course but m at least some instances it seems evident that the 
existence of two basic kinds of physiological mechanisms is reflected 
in the behavior of the psychological scales and functions which we 
construct from subjective measurements in the sensory domain 
Most of what follows is concerned with prothetic continua for they 
seem the more interesting and well behaved It should be noted 
however that die pitch continuum provides an example of a rather 
exciting attempt to match up and thereby "explain” several ps)cho 
physical functions by means of a physiological substratum Position 
of maximal excitation on the basilar membrane appears to relate m 
a straightforward linear manner to several sensory functions including 
the me! scale of suhjectne pitch the jnd and the so called critical 
bandwidth (B4hesy and Rosenblith 1931 Zwicker, Flottorp and 
Stevens 1957) 


Three Kinds of Sensory Measures 

Three separate classes of sensory scries ire distinguished (and 
sometimes confused) in psychophvstcs 

1 Discnminahlhltj scales These an. constmcted in the tradition 
of Fechner or his modem counterpart Thurelone Some measure of 
jnd variabil ty confusion or resolving poucr is emplojcd as a unit 
and a scale is constructed by counting off such units 

2 Category scales (partition scales) These are constmcted b) one 
or another varnhon on the proceduc that Plateau imented when 
he required nbseners to partition a segment of a continuum Into 
equal appearing mtcnals (Plateau had eight artists paint a gra) 
that seemed to he halfuay bctuccn bind. and svhite ) Bisection is 
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one partitioning procedure, asking a listener to assign a senes of tones 
to n equally spaced categories is another 
3 Magnitude scales These are ratio scales of apparen magnitude, 
constructed by one or another of four principal methods, of which 
“fractionation" is perhaps the best known and magnitude estimation 
the most useful (see Stevens, 19366. 19396) Under the method of 
magnitude estimation the observer simply estimates the apparent 
strength or intensity of his subjective impressions relative to a stand 



Fig 1 Three lands of psj chological measures of apparent dura bon Triangles 
mean magnitude estimations by 12 observers who judged the apparent durations 
of white noises. Circles mean category judgments by 16 observers on a scale 
from 1 to 7 The two end stimuli (0.25 and 4.0 sec) were presented at the out 
set to indicate the range and each observer twice Judged each duration on a 
7 point scale Dashed curve discnmmabflity scale obtained by counting off jntL 

ard, or modulus, set either by himself or by the experimenter The 
power functions obtained by this procedure can, and indeed should, 
be validated by direct cross modality matches — a procedure that does 
not require the observer to make numerical estimations 

An important difference between prothetic and m eta thebe con 
tmua is this on metathebc conbnua all three lands of scales tend to 
be linearly related one to another on prothebe conbnua the three 
kinds of scales are nonhnearly related (Stevens and Galanter, 1957, 
Stevens, 1959c) Typical examples of the relabons among the three 
lands of scales on prothebe conbnua are shown m Fig. 1, for apparent 
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dunhon (of a none), and m Fig 2, for apparent mtensity of vrbra 
Hon appbed to the fingertip On all protfietic continue the magnitude 
scale is a power function, the disenmimbihty (jnd) scale approxi 
mates a logarithmic function, and the category scab assumes a farm 
intermediate between (he other two Over the different sense modali 
tics, these relations among the three scales are strikingly invariant, 
they constitute one of the really stable aspects of psychophysics 
Of the tliree kinds of measures shown m Tigs 1 and 2, the one that 
seems most directly related to the over all input output function of a 



Fla 2 Three kinds of psychological measures of the apparent Intensity of a 60 
cycle vibration applied to the finger tip Procedures were essentially similar to 
those for Fig 1 For details see Stevens ( 1959d) 

sensory system is the magnitude scale The scale obtained by count 
i ng off jnd is really only that At most it tells us how resolving power 
vanes with stimulus magnitude The category scale is at best only 
an interval scale on which the zero point is arbitrary It is not a ratio 
scale But since it is nonlmearly related to the ratio scale of apparent 
magnitude, the category scale turns out, in fact, to be not even a good 
interval scale The reasons for the curvature of the category scale 
have been discussed elsewhere (Stevens and Galanter , 1957), but, 
roughly speaking it is as though the observer, when he tries to parti 
hon a continuum into equal intervals, finds himself biased by the fact 
that a given difference at the low end of the scale is more noticeable 
or impressive than the same difference at the high end of the scale 
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This asymmetry is not present on metathebc conbnua, and therefore 
the category scale is not systematically curved 


Operating Characteristics 


Sense organs serve as the transducers that convert the energies of 
the environment into neural form Like any transducer, each sense 
organ has its dynamic operating characteristic, defined by the input- 
output relation It is only recently that much attention has been paid 
to the dynamics of sensory function — the manner in. which the sensory 
system responds to variations in input intensity. Future efforts in 
this direction promise interesting rewards, however, for the form of 
the over all dynamic process is now becoming more fully understood 
Conceivably, of course, all sense organs could have the same 
operating characteristic All sensations would then grow at the same 
rate with increasing stimulus intensity That this is far from true can 
be readily verified by a simple comparison Note, for example, what 
happens when the luminance of a spot of light is doubled Then 
note what happens when a 60-cycle current passing through the 
fingers is doubled Doubling the luminance of a spot of light in a 
dark field has surprisingly little effect on its apparent brightness As 
estimated by the typical (median) observer, the apparent increase 
is only about 23 per cent. But doubling the current through the 
fingers makes the sensation of shock seem about ten times as strong 
The dynamic operating characteristics of these two sensory systems 
are clearly and dramatically different 
Closer invesUgation reveals, however, that both brightness and 
shock have a fundamental feature in common In both instances 
the psychological magnitude is related to the physical magnitude 
4* by 


The exponent n has the value 033 for brightness and 35 for shock 
The value of k depends merely on ones choice of units As will be 
shown below, the physical measure used to express <* needs to take 
account of threshold. 

The power function has the convenient feature that in log log 
coordinates it plots as a straight line whose slope is equal to the value 
, UlC Cip ? n ?. figure 3 illustrates this fact and shows how the 

!wt gr0 A ' COnttasts SVllh thc ra P*d growth of electric 

shock Also included [or compmson is the function obtained by 
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fig. 3. Sales of apparent magnitude for three prothetic conffnua plotted in 
log-log coordinates. The slope of the line corresponds to the exponent of the 
power function governing the growth of the psychological magnitude 



Fig- 4. In linear coordinates the subjective magnitude functions are concave 
upward or downward depending on whether the power function exponent is 
greater or less than 1 0 

asking observers to make magnitude estimations of the apparent length 
of various lines Here, as we should expect, the slope (exponent) of 
the function is not very different from 1 0 This is another way 
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0 £ saying that to most people a length of 100 centimeters looks about 
twice as long as a length of 50 centimeters , 

The same three functions shown in Fig 3 are plotted in linear 
coordinates in Fig 4 The function for apparent length is almost a 
straight line (exponent about 11), but electric shock grows as an 
accelerating function and brightness as a decelerating function 


Exponents 

The number of prothetic contmua on which the psychophysical 
power law has been shown to hold to at least a first-order approxi 
mation now exceeds two dozen In the authors experience, there 
appears to be no exception (Hence the temerity of calling it a 
l aw ) 

Table 2 lists the exponents of the power functions for some of the 
contmua explored thus far Although this table extends and revises 
the list presented earlier (Stevens, 1957b), it must still be regarded 
as tentative and incomplete, for there is virtually no limit to the 
number of different combinations of sense organs and stimuli that are 
waiting to be studied. 

All the exponents in Table 2 were determined by the method of 
magnitude estimation Many of them have been confirmed in other 
laboratories and by other methods, such as fractionation Many of 
them, as we shall see, have also been validated by cross modality 
intercompansons Nevertheless, it must be understood that the exact 
value of an exponent is difficult to determine with precision, and 
some of those listed m Table 2 must be regarded as a first approxi 
mation only In all cases, of course, the exponent represents an 
average value and is not necessarily appropriate to a particular mdi 
vidual At least ten observers were used to determine each of the 
exponents in Tabic 2, although some exponents (for example, hud 
ness, brightness, lifted weights) have been determined in several 
laboratories and on large numbers of observers 

The particular version of the method of magnitude estimation used 
In our most recent experiments — the one amved at after some years 
of tnat and error— is extremely simple In an experiment on loud 
ness, for example, the procedure may be as follows The experimenter 
presents a standard" sound of moderate intensity and tells the ob- 
server to consider its loudness to have the value of “10" The expen 
menter then presents in irregular order a senes of intensities above 
and below- the standard and instructs the observer to assign to each 
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stimulus a number proportion'll to the appirent loudness In other 
words, the question is if the standard is 10, whit is each of the other 
stimuli? The observer is told to use any numbers that seem appro 
pnate, fractions, decimals, or whole numbers, and to judge each 
stimulus as he hears it The standard is usually presented only at the 


Table 2. Representative Exponents of the Power Functions 
Relating Psychological Magnitude 
to Stimulus Magnitude on Prolhetic Contmua 


Continuum Exponent 


Loudness 

06 

Loudness 

0 54 

Br ghtness 

0 33 

Brightness 

05 

Lightness 

12 

Smell 

055 

Smell 

06 

Taste 

08 

Taste 

13 

Taste 

13 

Temperature 

10 

Temperature 

16 

Vibration 

0 95 

Vibration 

06 

Duration 

11 

Repetition rate 

10 

Finger span 

13 

Pressure on palm 

11 

Heaviness 

145 

Force of handgrip 

17 

Autophomc level 

11 

Electric shock 

35 


Stimulus 

conditions 

Binaural 

Monaural 

5° target — dark adapted eye 

Point source — dark adapted eye 

Reflectance of gray papers 

Coffee odor 

Heptane 

Saccharine 

Sucrose 

Salt 

Cold— on arm 

Warmth — on arm 

60 cps— on finger 

250 cps — on finger 

White noise stimulus 

Light sound, touch and shocks 

Thickness of wood blocks 

Static force on skin 

Lifted weights 

Precision hand dynamometer 

Sound pressure of vocalization 

60 cps through fingers 


beginning of the senes although a stimulus having the same intensify 
as the standard may appear as one of the stimuli to be judged along 
with the others With a senes of six to ten stimuli each stimulus is 
usually presented twice, but the order of the stimuli is made different 
for each observer In the averaging of the data from a group of 
observers it is usual to compute the geometnc means of the estimates, 
although sometimes the median provides a more representative 
measure Since the distributions of responses are usually skewed 
the arithmetic mean is seldom an appropriate statistic 
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Cross-Modahty Comparisons 

Few scientists fail to sense an uneasy concern about the foregoing 
procedure, which seems to rely merely on the observers expression 
of opinion, and winch seems also to depend on his having a moderately 
sophisticated understanding of the number system This is a proper 
concern, because naivet6 about numbers, and especially about the 
concept of proportion, certainly impedes the ability of some observers 
to perform well m this kind of experiment. The matching of numbers 
to sensation intensity is not something that a person does with fine 
precision, or that he feels great certainty about, even though the 
typical graduate student can usually manage a consistent set of 
estimates 

The interesting question, however, is not whether we are uneasy 
about the procedure, but whether the experiments on magnitude 
estimation can predict other empirical consequences that can be put 
to test In particular, can we confirm the power law without asking 
observers to make any numerical estimations at all? If so, can we 
proceed to verify the relations among the exponents listed in Table 2? 
An affirmative answer to these questions is suggested by the results 
of a method in which the observer equates the apparent strengths of 
the sensations produced in two different modalities By means of such 
cross modality matches, made at various levels of stimulus intensity, 
an “equal sensation function” can be mapped out, and its form can 
be compared with the form predicted by the magnitude scales for 
the two modalities involved 

If, given an appropriate choice of units, two modalities are governed 
by the equations 

= 

and 


$2 “ <#>2 n 

and if the subjective values ^ and y 2 are equated by cross modality 
matches at various levels, then the resulting equal sensation function 
Will have the form 

£i"* *“ 4>2 n 

In terms of logarithms 

*°g£i =* n/m log 2 

In otter words, m log log coordinate the equal sensation function 
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should be a straight line whose slope is given by the ratio of the two 
exponents 

The experimental question is whether observers can mike cross 
modality matches, and whether their matches can, in fact be pre 
dieted from the ratio scales of apparent magnitude determined 
independently by magnitude estimation The ability of observers to 
make the simple judgment of apparent equality has been well estab 
lished in other contexts Heterochromatic photometry and the 
mappmg of equal loudness contours provide two well hnown examples 
of procedures that involve the judgment of apparent equality of sensory 
intensity — a judgment made in the presence of an obvious qualitative 
difference It is but a small step to extend these procedures to cross 
modality equations As a matter of fact, some cross modality equa- 
tions seem less difficult than some equations within a single mo 
dahty 

In principle, of course, cross modality matches can be made between 
every sensory continuum and every other one Since this potential 
enterprise involves heroic numbers of experiments, only certain illus 
trative tests have been completed They are sufficient, however, to 
demonstrate the general validity of the ratio scales of subjective 
magnitude. A few of these cross modality experiments will be 
described. 


Loudness versus Vibration 

Two stimuli that are relatively easy to equate for apparent strength 
are sound and mechanical vibration The sound employed was a 
band of noise of moderately low frequency, and the vibration was 
a single frequency (60 cps) delivered to the end of the middle finger 
(Stevens, 1959a) 

The matching of the apparent intensities of sound and vibration 
was carried out in two complementary experiments fn one expert 
ment the level of the sound was adjusted to match the vibration, m 
the other the level of the vibration was adjusted to match the sound 
The sound and vibration were presented simultaneously (In many 
experiments the stimuli have been presented successively for one 
reason or another ) Each of ten observers made two adjustments at 
each level m each experiment 

The results are shown in Fig 5 The circles represent the means 
of the decibel levels to which the sound was adjusted and the squares 
represent the means of the decibel levels to which the vibration was 
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adjusted The coordinate scales are in decibels relative to the approx- 
imate thresholds of the two kinds of stimuli 
The interesting point to note is that the slope of the line in Fig 5 
is 06 which is close to the slope that is called for by the ratio of the 
exponents of the two magnitude functions It is also apparent that 
the relation is essentially linear, which is consistent with the fact that, 
over the ranges of the stimuli involved, both loudness and vibration 
ate governed by power functions 



rig 5 An equal sensation function relating 60-cycle vibration on the finger tip 
to the intensity of a band of noise The observers adjusted the loudness to match 
the vibration ( circles) and the vibration to match the loudness (squares) The 
stimulus values are measured in terms ol logarithmic scales l decibels) 


The departure of some of the points from the straight line in Fig 5 
is tn large measure due to the interesting fact that, depending on 
which stimulus is adjusted, the slope turns out to be slightly different 
The situation is analogous to the two regression lines in a correlation 
plot. This “regression'’ or “centering tendency” is common, if not 
universal, in matching procedures, and it points up the desirability' 
of a balanced design in which each stimulus is made to serve as both 
the standard and the variable (see Stevens, 1953b) The matching 
of loudness and vibration turned out to be surprisingly easy Some 
of the observers, who happened to ha\e served m loudness matching 
experiments expressed the opinion that matching loudness to vibra 
tion seemed easier than matching the loudnesses of two tones of 
widely different pitch or quality <c£ Stevens, 1936a) The con 
sistency of the judgments seemed to bear this out 
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two of the many experiments 
mme ratio scales of subjective 


that have been performed to deter- 
magmtude for loudness and bright- 


ness 



rig. 6. Results of adjusting a loudness ratio to match an apparent brightness 
ratio defined by a pair of luminous aides. One of the cueles was made dimmer 
than the other by the amount shown on the abscissa. The observer produced 
white noises by pressing one or the other of two keys, and be adjusted the level 
of one noise (ordinate) to make the loudness ratio seem equal to the brightness 
ratio The brighter light was about 99 db re 10 -10 lambert, and the louder 
noise was about 92 db re 0 0002 dyne per square centimeter 


It should be pointed out that the use of decibel scales simplifies the 
stimulus specification for vision and audition, and facilitates com- 
parisons between their sensory dynamics The foregoing examples 
demonstrate some of these advantages Although the application of 
decibel measures to visual stimuli is not yet common practice, there 
is much to recommend it (see S S Stevens, 1955c). As a matter of 
fact, except for the rigidities of professional custom, the application 
of the decibel notation to the measurement of light presents less diffi- 
culty than its application to sound, because the decibel is defined in 
terms of energy flow 

iV db = 10 log 

£.0 

and it is only by a kind of bastardized extension that the decibel gets 
used with measures of sound pressure. The energy m a sound wave 





(Circles decibels re 0 0002 m crobar) 

(Squares, decibe Is re JO“ 10 lambert) 

7. Medina magnitude estimations lor loudness and brightness. 
or the loudness of a 1000-cycle tone, each of 32 observers made 2 estimates 
ach level Since no standard modulus was designated, each observer chose 
own, and the resulting numerical estimates were transformed to a common 
ulus at the 80 db level 

or brightness each of 28 dark adapted observers made 2 estimates of each 
ulus level The target subtended an angle of about 5 degrees and was illu 
tted for about 3 sec Once at the beginning of each session the observer was 
n a Stimulus of 70 db (14 observers) or 80 db (14 observers) and told to 
it ‘TO ” The estimates were transformed to a common modulus at 70 db 

roportional to the square of the sound pressure, but only under 
speaal conditions With light, on the other hand, we are con 
ed only with energy measures, relative or absolute, and there is 
eed to become entangled m measures that are nonhnearly related 
nergy. 


rce of Handgrip 

ke any other sensation, the subjective impression of muscle ten- 
can be measured on a ratio scale of psychological magnitude 
queezing a precision dynamometer (Fig 8), an observer can 
Lice a sensation of apparent force and at the same lime activate 
I that indicates the actual force exerted Two pertinent questions 
themselves (1) How does the feeling of apparent force relate 
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mdicator of other subjective magnitudes has led to even more exciting 
results Instead of asking observers to emit numbers m response to 
stimuli, we can ask them to emit squeezes of appropriate sizes In 
tins manner, observers have matched apparent force to apparent 
sensory intensity on nine different conhnua and have produced the 



fig lo. Functions for apparent force of handgrip obtained by the method of 
magnitude estimation. Each curve is for a single observer, and the position of 
the curve on the abscissa is arbitrary (a) Medians of 6 estimates by each 
observer using the dynamometer shown in Fig. 8 The forces estimated were 
4, 10, IS, 22, SO, and 40 pounds, (b) Medians of 10 estimates by each observer 
using the more compliant dynamometer The forces estimated were 5, 11, 17, 
23, 29, and 35 pounds. 

results shown in Fig. 11 (] C Stevens, Mack, and S S Stevens, 1960, 
J C Stevens and S S Stevens, I960) 

Two points are immediately evident. All the data in Fig. 11 

approximate power functions — straight lines in log log coordinates- - 

and the slopes stand m the same order as the values of the exponents 
listed m Tahlc 2. Less obvious but even more interesting is the exact 
numerical relation between the slopes determined by matching with 
handgrip and those determined by matching with numbers (that is 
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ffg II. Equal sensation functions obtained by matching force of handgrip to 
various criterion stimuli Each point stands for the median force exerted by 10 
or more observers to match the apparent intensity of a criterion stimulus The 
relative position of a function along the abscissa is arbitrary The dashed line 
shows a slope of 1 0 in these coordinates. 


Table 3. The Exponents (Slopes) of Equal-Sensallon Functions, 
as Predicted from Ratio Scales of Selective Magnitude, 
and as Obtained by Matching with Force of Handgrip 


Ratio scale Scaling by means of handgrip 

Exponent of Stimulus Predicted Obtained 

Continuum power function range exponent exponent 


Electric shock 
(60-cycle 

35 

current) 

Temperature 

(warm) 
Heaviness of 

16 

Jilted weights 
Pressure on 

145 

palm 

Temperature 

11 

(cold) 

60-cyclo vibra- 

10 

tion 

Loudness of 

095 

white noise 
Loudness of 
1000-cycle 

06 

tone 

Brightness of 

06 

white light 

033 


0 29-0 72 milliampere 
2.0-14-5*C above 
neutral temperature 

28-480 grams 

0 5-5 0 pounds 
3 3-30 6*C below 
neutral temperature 
17-47 db re appron 
mate threshold 

55- 95 db re 00003 
dyne/cm 2 

47-87 db re 00002 
dyne /cm 2 

56- 98 db re 10" 10 

1 amber t 


2 06 2.13 

0 94 0 98 

085 079 

065 067 

0 59 0 60 

0 56 0.58 

035 0 41 

0 35 0.35 

0.20 0.21 
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to say, magnitude estimation) Since the exponent for handgrip is 
approximately 17, we should expect that the exponent for a given 
continuum in Table 2 would be about 1 7 tunes as large as the slope 
of the corresponding hne in Fig 11 How nearly this expectation is 
fulfilled is shown by the comparisons m Table 3 Despite the varia- 
bility inherent in experiments of this sort, the agreement between 
the obtained and predicted exponent is generally satisfactory This 
agreement testifies with a certain eloquence to the basic validity of 
the ratio scales of sensory magnitude 


The Stimulus Scale 

Except for the two continua, warm and cold, the physical stimuli of 
all the continua discussed above have been measured on the ordinary 
physical scales of amperes, grams, dynes, and so on This practice 
is sufficiently accurate for most purposes, but when we look more 
closely we see that the general form of the power law is 

< I' “* *($ <*>o) n 

where <}> Q is a constant value corresponding to “threshold” For 
ranges of stimuli well above the minimum detectable level, the value 
of <£o is usually negligible, but it assumes larger proportions when 
subjective scales are extended downward toward very low values 
Temperature provides a clear and dramatic example of the 1 m 
portance of measuring stimuli m terms of the ratio scale of distance 
from threshold (J C Stevens and S S Stevens, 1960) The threshold 
for warmth when aluminum stimulators are applied to the inside of 
the forearm is about 305 7° above zero on the absolute scale (Kelvin) 
Compared to the short range of tolerable thermal stimuli, thic is indeed 
a high threshold 

As shown in Fig. 12 (log log plots), when apparent temperature 
is scaled by magnitude estimation and the results plotted ag ains t the 
Kelvin scale the data fall on a curve that is sharply concave down 
ward When plotted w terms of degrees above the neutral or 
threshold value, however, the data fit a power function with an ex 
ponent of about 1 6 From these measurements it follows that the 
power function formula for subjective warmth {>*, js 

V--fc(T, -3057)i o 
where T* is absolute temperature. 
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In a similar type of experiment {aluminum stimulators applied to 
toe arm), the formula for cold turned out to be 

i h- (3042 — Tjt) 1 0 

The difference between the two values, 305 7 and 304 2, corre 
s ponding to <j> 0 is of no particular significance It presumably means 
that the observers’ average skin temperature was different in the two 
experiments On most other continua the value of the additive 
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fig 12 Magnitude estimation of apparent warmth Each point is the geometric 
mean of 36 estimates (12 observers) The upper abscissa for the filled points, 
M a log scale of the difference m temperature (Celsius) between the stimulus 
ind the “physiological zero “ The lower abscissa, for the unfilled points is a log 
scale of the absolute temperature (Kelvin) 

oaostsot Jo & sms}} relative £&? vsabie stumdus range Neveiihe 
less, m two instances a revision of the stimulus scale designed to take 
explicit account of <p Q has transformed otherwise wayward data into 
well behaved power functions The scale for tactile vibration 
(60 cps) applied to the arm was corrected in this manner (Stevens, 
1959d), and a similar treatment was applied to the loudness scale by 
Scharf and J C Stevens (in press) As Luce (1959) Ins pointed 
out, the use of an additive constant to bring the zero of the physical 
scale into coincidence with the zero of the psychological scale is a 
proper generalization of the power function Jaw (Differences on a 
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ratio scale constitute a ratio scale, as do also differences on an interval 
scale ) 

In calling fa the “threshold" value, we raise a problem concerning 
precisely what is meant by the term threshold. It is not necessarily 
the threshold as measured in some arbitrary manner under arbitrary 
conditions Rather it should probably be thought of as the “effective'* 
threshold that obtains at the time and under the conditions of the 
experiment in which the magnitude scale is determined Needless 
to say, this “effective” threshold cannot be measured very precisely 
Consequently, it becomes expedient to take as the value of <£o the 
constant value whose subtraction from the stimulus values succeeds 
in rectifying the log log plot of the magnitude function Provided 
the constant value so chosen is a reasonable threshold value, this 
procedure seems justified At any rate, it has worked well for the four 
continua, vibration, loudness, warmth, and cold 


Variability 

Needless to say, the responses people make to sensory intensity are 
variable Although an exemplary picture of this variability is shown 
in Figs 9 and 10, a further statement is in order about it The state 
meat will be bnef, however, because the author confesses to a certain 
lack of enthusiasm for elaborate statistical analyses 

By and large, the interquartile range encountered when groups of 
observers undertake magnitude estimation on mtensitive continua is 
of the order of 0 2 to 0 3 log unit (It may, of course, be lower for 
continua that are easy to judge ) This variability contains certain 
obvious components, however, the most important of which appear 
to be the following. 

1 Variability due to the observer’s modulus , that is, his conception 
of the standard” Since we me concerned only with the form of the 
magnitude scale, this source of variability is of no concern When 
desired, it can be partialed out m one way or another, with a conse 
quent reduction In the over all variability 
This component of variability is especially evident in those expen 
ments in which each observer is allowed to choose his own modulus 
(Stevens, 1956b) It also plays a prominent role in cross modality 
matches (Stevens 1959a, J C Stevens, Mack, and S S Stevens, 
1960) Each observer for example, has his own conception of what 
force of handgrip matches what level of loudness, but the absolute 
values chosen by the observer are irrelevant so far as the form oE the 
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equal sensation function is concerned It is only the relative values 
that matter Concretely, our mam concern is with the slopes and not 
the intercepts of the functions in Figs 9 and 10 

2 Variability due to the observers conception of a subjective iatio 
In a method like fractionation or magnitude estimation, each person 
must make up his own mind about what he considers “half as bright,” 
say, and not all observers arrive at the same conclusion (A plot 
showing an example of the variability encountered in halving and 
doubling m ly be found in Stevens, 1957a ) Nothing much can be 
done about this source of variability, except perhaps to try to avoid 
biases and constraints in the conditions of observation (see Stevens 
1956b) 

3 Variability due to differing sense organ operating characteristics 
Tins source of variability is biologically the most interesting, but it 
is probably of only minor magnitude m a group of “normal" observers 
Nevertheless, m the hard of hearing ear or the night blind eye, it may 
be a factor of considerable consequence (The state of a sense organ 
like the eye is also changed, of course, m the process of adaptation ) 
In order to evaluate this "sense organ" factor, it may prove useful to 
apply a battery of cross modality matching tests A sufficient battery 
of these tests should prove capable, for example, of distmgmslung 
the person with auditory recruitment from tire person whose con 
ception of a subjective loudness ratio is merely atypical 

An interesting problem related to variability concerns the question 
of departures from the power function law Can we expect that the 
power law will always hold rigorously (provided, of course, no errors 
arise in our measurements), or should we look for second order 
deviations from it? The data of particular experiments sometimes 
depart from the power law, but in most instances it is not easy to 
determine whether these defections are due to artificial biases of one 
land or another Nevertheless, since the possibility of genume 
departures from the power /aw is a problem of basic moment an 
effort should be made to devise procedures of sufficient accuracy to 
settle the question The fact that the power law is closely approxi 
mated by so many data in so many different sense modalities adds 
interest and significance to any authentic departures from the power 
function form 

The Rofe of Transducers 

The foregoing suggestion of a method for determining the individual 
loudness function in a hard of hearing ear assumes that the nature 
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of the sensory transducer largely determines the form of a magnitude 
function. An opposite assumption has often been made, however, 
to the effect that the magnitude function merely reflects how ob- 
servers have learned in the past to associate sensory impressions with 
some known aspect of the physical stimulus. The beaming” explana- 
tion has r ecen tly been revived (Warren, 1953), and, under the name 
“physical correlate theory,” it is alleged to provide a "basis for Stevens 
empirical law” (Warren, Sersen, and Pores, 1958). If this theory were 
correct, it would presumably explain why the psychophysical law is 
a power function, but the evidence that learning accounts for all the 
exponents in Table 2 is mostly nonexistent. Familiarity with the 
stimulus may be a factor in people’s judgments on some kinds of 
contmua (although how one would prove it is hard to see), but many 
of the contmua m Table 2 are quite unfamiliar to the typical ob- 
server — at least as regards measures of stimulus intensity And 
especially difficult to conceive is how familiarity with the physical 
stimuli— even if the observers had such familiarity — could account 
for the results of cross modality matching like those shown m Fig. li- 
lt seems rather more probable that the exponents are what they 
are because of the nature of the sensory transducers. It is likely, 
for example, that the exponents for light and sound are smaller than 
10 because these sensory transducers behave essentially as “com- 
pressors” — a characteristic that enables them to handle the enormous 
dynamic ranges of stimulation to which they are subjected. At the 
other extreme, in the transduction process involved with electric 
current applied to the fingers, there is an operation of “expansion” 
m the sense that the psychological magnitude grows as an accelerating 
function of stimulus intensity, that is, the exponent is greater thnn 1.0 
It seems quite improbable that the form of this function was “learned” 
by the observer. 


It is an interesting question whether electrical stimulation of nerves 
other than those m the fingers would also exhibit an accelerating trans- 
duction characteristic. In a study of the “electrophomc effect” (Jones, 
Stevens, and Lune, 1940), patients lac k i n g tympanic membranes were 
stimulated by means of an electrode placed inside the middle-ear 
cavity although some patients heard pure tones, seven of the group 
heard only a buzzing noise whose quality was more-or less inde- 
pendent of the frequency of the stimulating current Since it is 
certain that other nerves (for example, the facial and the vestibular) 
were occasionally stimulated m the course of these experiments it 
«*ms safe to conclude that the auditory nerve was also sometimes 
ducctb affected by the current Direct, unpattemed stimulation of 
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tlie auditory nerve fibers would account for the patients hearing 
only a noise ° 

Some of tlie patients noted a large change in loudness when only 
a small change was made m die stimulating current This effect was 
so striking that an attempt was made to measure the loudness change 
by comparing it with a sound in the opposite (normal) ear The 
outcome is shown in Fig 13 (for further explanation, see Stevens, 
Carton, and Shickman, X9o8 ) Apparently if the auditory end organ 



Fig 13 Showing the steep growth of loudness with increasing electric current 
applied to the auditory nerve of a patient whose eardrum had been removed 
The current was delivered by an electrode placed in the middle-ear cavity The 
exponent is about four times as large as the exponent obtained with acoustic 
stimulation 

is bypassed, and if a stimulating current acts directly on the eighth 
nerve, a new transducer process becomes involved, and the function 
describing the growth of loudness acquires a radically different ex 
ponent The implication is that the compression” observed when 
the normal ear is stimulated by sound waves is a function of the sense 
organ, not of some higher center in the nervous system 
When it is discovered that two conhnua presumed to be rather 
similar are governed by different exponents (Tabic 2), one suspects 
that there may be basic differences in the transducer systems involved 
Two pans of such conhnua are especially interesting warmth and 
cold and taste and smell. Warmth and cold are interesting because 
the same stimulating device, applied to die same place on die arm 
produces two scales of sensory intensity, one for stimulus tempera 
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hires below neutral and one for temperatures above neutral (J C 
Stevens and S S Stevens, 1960) Not only do warmth and cold 
produce sensations o£ different quality, they also appear to do it 
by means of transducers with different operating characteristics The 
temperature sense is also unique in that the neutral or threshold point 
is the bottom of one stimulus scale (warmth) and the top of another 
(cold) Cold increases as the stimulus value decreases 
Taste and smell are often classed together as chemical senses The 
mode of action of the stimulus for smell has been such a mystery, 
however, that mechanisms other than the bathing of end organs by 
chemical solutions have been hypothesized from time to time This 
state of affairs gives added significance to the obvious difference 
between the operating characteristics of the olfactory and gustatory 
systems Tor the several substances thus far tested, the exponents 
of the power functions for olfactory intensity have run from about 
On to 0 6 (Jones, 1958, Reese and Stevens, 1960) Earlier expen 
ments on taste, with the method of fractionation, gave exponents of 
the order of L0 (Beebe Center and Waddell, 1948) The exponents 
for taste listed in Table 2 were obtained by Mary McLean in some 
exploratory experiments with the method of magnitude estimation. 
This work is still m progress, but there is little doubt that the expo- 
nents for taste are generally about twice as large as the exponents 
for smelL Does this difference in the dynamics of apparent intensity 
mean that two wholly different mechanisms underly the transduction 
processes in taste and smell? 


Summary 


A central issue in the struggle to transform psychology into a 
science has long concerned the quantification of sensory magm 
tudes Fechners creation of ps> chophysics was a monumental at 
tempt to measure sensation, but the model as well as the method 
by which he hoped to achieve its quantification has turned out to 
be a vigorous stride in a wrong direction. Other models envisaging 
other methods, have lately achieved success in the mapping of quan 
titahve relations on sensory con tin ua. 

The measurement of sensation in a manner useful to science has 
followed upon the clarification of several sticky issues that formerly 
thwarted the proper investigation of quantitative relations Progress 
has been made on the following points 
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1 Sensation has ceased to be regarded as a private mentalistic 
enhty comprising one lialf of a metaphysical dualism, and has joined 
the class of other natural constructs, defined m terms of the opera 
tions used to denote and measure it We study sensation by noting 
the behavior of organisms under the impact of energetic configura- 
tions (stimuli) 

2. The concept of measurement has been broadened to include 
much more than the operation of counting, which was all there was 
to primitive measurement Measurement is now conceived as the 
assignment of numbers to objects or events according to rule The 
distinguishable rules, and the invariances consistent therewith, entail 
four classes of scales nominal , ordinal, interval, and ratio A fifth 
class, called logarithmic interval, is of some academic but little prac 
tical interest 

3 A basic distinction can be made between two kinds of sensory 
contmua, prothebc and metathetic The distinction rests on a set 
of functional criteria, but it is useful to think of the pro diet ic contmua 
as those concerned with how much (intensity) and the metathetic 
contmua as those concerned with what or where (quality) At the 
physiological level, many prothetic contmua (loudness, brightness, 
and so forth) involve the addition of excitation to excitation, whereas 
some of the metathetic contmua (pitch, apparent azimuth, inclination) 
involve the substitution of excitation for excitation 

4 Ratio scales of sensation, permitting a direct determination of 
the input output characteristics of sensory systems, have been erected 
on prothetic contmua by means of a variety of simple operations The 
principal methods have involved (a) direct numerical estimations by 
the observer and ( b ) cross modality comparisons in which the appar 
ent strength of a sensation m one modality is made equal to a given 
sensation in another modality 

The outcome of these developments is a simple and attractive power 
law relating psychological magnitude to physical stimulus 
The formula is 

V' “ W<f> — #o)" 

where <f> 0 is the effective threshold value On some two dozen eon 
tinua the exponent n has been found to range from 0 33 for visual 
brightness to 3 5 for the subjective strength of electnc current applied 
to the fingers The generality and utility of tins quantitative law 
may open new vistas in psychophysics and provide new insights into 
the roles played by the sensory transducers 
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Some Aspects 
of Psychophysical Research 


A Note on Measurement 

Although the problems studied m psychophysics are rather varied, 
one thing is central to all the work subsumed under this name That 
is the measurement of subjective variables In the present stage of 
psychophysical development vve are concerned in particular with 
quantifying human experience on a relatively simple level, for example, 
measuring subjective brightness or pitch or perceived shin pressure 
This is not necessarily the whole thing, but it is fairly true for the 
present situation 

The measurement technique, which is used m most of this work, 
has been developed mainly by S S Stevens and his co workers over 
a number of years Without going into any detail, I shall briefly 
characterize this sort of scaling I call it direct scaling of subjective 
variables, because the essential steps of the scaling procedure are 
implied m the experimental situation You may present a tone to the 
subject and tell him that the loudness of this standard tone is called 
100 Now, in relation to this, what is the loudness of another tone 
of equal pitch? You may obtain the estimate 62, and this datum 
is one observation of the scale value of the second tone The two 
scale values are, by definition, on a ratio scale the ratio 62/100 should, 
according to the instructions, be equal to the subjective ratio of the 
second loudness to the loudness of the standard Of course, before 
the final scale can be constructed, you have to collect many observa- 
tions and do some computational tricks with them, but fundamentally 
the situation is about as simple as outlined here 
Even if this procedure seems to be quite straightforward, one might 
still be in some doubt about the metric properties of the resulting 
scale In an investigation carried out by Goude (1959) m this lab- 
35 
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oratory, some fundamental properties of seales obtained by dircet 
scaling methods were studied Results from two of Couchs experi 
ments are shown m Fig 1 These experiments were designed to test 
the additivity of scale values In one part of the experiments (empty 
circles) the subjects were comparing single stimuli two at a time, 
as is commonly done in scaling In another part of the experiments 
(filled circles) the subjects were working with subjective sums of the 



rig 1 Results from an experiment on weight lifting (a) and another experiment 
on subjective magnitude of angles (b), earned out in order to test the additivity 
of scales constructed by direct ratio methods (Goude 1959) Unfilled circles 
represent scale values for single stimuli Filled circles represent scale values 
obtained when the subjects were working with subjective sums of subjective mag 
m hides In each experiment both sets of data exhibit the same trend that Is 
closely approximated by the power functions represented by the curves 

subjective magnitudes under consideration. The two sets of data thus 
obtained m each experiment clearly exhibit the same general trend 
and may well be approximated by the same curves, as shown in the 
graphs This is probably the first time that the fundamental metric 
property of additivity has been demonstrated in scales obtained by 
direct psychophysical methods of measurement Results like these 
should increase our confidence m these methods 

Three Types of Problems 

Stimulus-response relations 

It was Fechners idea to investigate the relation between a subjec 
bve intensity and the intensity of the physical stimulation evoking 
the subjective response On the basis of Weber s law (which usually 
holds over a wide range) and the assumption that just noticeable 
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stimulus differences are subjectively equal (which m general they are 
definitely not, as we now know), he derived his famous logarithmic 
law which has ever since continued to appear in textbooks without 
ever being verified 

In recent years, however, the power function has been found to 
describe the psychophysical relation very adequately for about twenty 
contmua so far investigated Convincing evidence has been prc 
sented, especially by Stevens (1957) and Stevens and Galanter (1957) 
There is hardly a really convincing exception known to this rule We 
usually write the power function in the form 

R = c(S + a)" (1) 

We have already seen two power functions fitted to the data in 
Figs In and b As a further illustration of this psychophysical hw, 

I have chosen an experiment from our laboratory concerning the sub 
jective brightness of monochromatic light of various wave lengths 
(Ekman, Eisler, and Kunnapas, I960) In order to avoid confusion, 
the data have been divided into two graphs of Fig 2. The curves 



fig Brightness as a function of stimulus Intensity for rjght wave Icngthl 
Thu circles and other symbols represent the scale values with a common urnt of 
measurement to order to avoid confusion between nearly coinciding curves. the 
data are plotted mid the eutves drawn In two separate graphs The curve. repre- 
sent power functions For the various wave lengths the follow uig functions are 
obtained 


R 4-a - -l93(S-0 6S)o«« 
R,„- 62O(S-0GO)O-3« 
0 - 55)0 

IU, - 996(S- 0 50)oo-« 
Rjm- 9 71(S-052)o-»« 
- 10 02(S - 0.51)0 *« 
JV 49 - 330(5 
ficn- 1-21(5 - a»S)«« 
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represent power functions fitted to llie data, and there is a rather good 
agreement between these theoretical curtLS and the experimental 
data, which represent brightness values in terms of a ratio scale con- 
structed by one of the direct methods mentioned above (magnitude 
estimation). . 

The parameter descriptive of the curvature of the function is the 
exponent This is about one third for all wave lengtlis, which is about 
the same order of magnitude as the exponent for white light The 
parameter c describes the relative brightness of a light of constant 
intensity and varying wave length And finally, the parameter a can 
be considered to be related to the absolute threshold, although this 
relation may be complex (Ekman, 1959) 

Psych ophyslologlcal relations 

By far the greater part of modem ps>chophysical work has been 
devoted to the classical problem of stimulus response relations The 
mam outcome has been the power law with its rather remarkable 
generality This law is empirical and descriptive, and per sc it does 
not tell us anything about the mechanisms responsible for the trans- 
formation it describes 

In general, these mechanisms will have to be investigated on the 
psychophysiological level, but very bttle has been done so far It is 
well known that peripheral response processes often bear a logarith- 
mic relation to stimulation Since the final subjective outcome is a 
power function of stimulation, we have to expect that the central 
process generating the subjective response is an anUlogarithmic func- 
tion of the peripheral process No such function has even been 
plotted, which I think should be a challenge to research workers in 
the neighboring fields of neurophysiology and psychophysics 
However, we have some information about differential sensitivity 
which will have to be incorporated, in any theory of the psychophysio- 
logical transformations of a stimulus intensity into a subjective mag 
mtude such as brightness or heaviness or pain. This information may 
be briefly summarized as follows 

1 Differential sensitivity, as measured on the stimulus continuum, 
is usually adequately described by the modified form of Weber’s law 

AS — k,S + a (2) 

This has long been known and needs no particular illustration 
here 

2. Differential sensitivity, as measured on the subjective continuum. 



SOM£ ASPECTS OF PSYCHOPHYSICAL RESEARCH 39 

seems to follow tile same principle 

AR =* k R K -j- b (gj 

This is less well known and deserves a few comments It was prob- 
ably first pointed out by Harper and Stevens ( 1918) that the subjective 
counterpart of a just noticeable difference tends to increase with S 
In some recent studies from our laboratory we have tried to measure 
the just noticeable difference in terms of R, and Fig 3 shows the 
results for three sets of data The graphs verify Eq 3, they also 
suggest that the constant 6 may be negligible, but that question re 
quires further investigation 



tig 3. Just noticeable differences in subjective units AR, plotted against subjective 
magnitude R (a) values computed from data for lifted weights in two experi- 
ments by Oberlin (1930) transformed to a subjective scale, ( b ) results from an 
experiment on visually perceived velocity, and (c) results from an experiment on 
aurally perceived tune (Reproduced from E Jan an, 1959 ) 


Let us for the moment accept both of the propositions stated above 
(Eqs 2 and 3) and see what we can do with them I think that the 
empirical information we now have may appear a little more intelli- 
gible if we look at it from the following points of view 

1 The discriminative response appears to be essentially a periph- 
eral process This would follow from the modification of Webers 
law m Eq 2 and the demonstration that peripheral response is a loga- 
rithmic function of stimulus intensity When considered together, 
these relations mean that a just noticeable difference on the stimulus 
continuum has a counterpart in a constant increment on the continuum 
of peripheral response In other words, the derivation of Technor's 
law would be valid at the level of peripheral response 

2 The constant increment of peripheral response, however, corre- 
sponds to a subjective increment that is linearly related to the total 
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subjective magnitude This indicates a multiplicative transformation 
of peripheral response into subjective response, and this is precisely 
the anhlogantlimic transformation we need to come out with the 
power function relating subjective magnitude to stimulus intensity 

Perhaps we may tentatively conclude that discrimination and per- 
ception of subjective magnitude are psychophysiologically quite dif- 
ferent processes, the latter being the final outcome of a step by step 
process of transformations 

Inirasubjecttve relations 

Finally, I shall illustrate a type of problem that is more purely 
psychological than the two problems we have considered so far Untd 
now we have been investigating psychological variables as functions 
of physical or physiological variables In the present context we are 
going to study relations between psychological variables 

One such problem which has been studied for some time in our labo 
ratory is concerned with the experience of similarity What function 
of their subjective attributes is subjective smulanty between two 
stimuli? So far we have investigated this problem only for umdimen- 
sional continua, that is to say, we have allowed the two stimuli to 
vaxy only in one subjective dimension I shall illustrate this work by 
our first experiment, in which we studied the similarity of pure tones 
of varying pitch and equal loudness In one part of the experiment 
a pitch scale was constructed and in the second part of the expen 
ment a scale of subjective similarity was obtained. It was now pos 
siblc to investigate smulanty as a function of pitch ( Eisler and Ekman 
1959) The result may be expressed by the equation 

“ ( R , + /!,)/2 • S 

where s tJ vanes from 0 (complete absence of similarity) to 1 (iden 
tity) Data from this experiment on pitch are shown m Fig 4, where 
the experimental measures of smulanty are plotted against the theo- 
retical values of similanty computed by Eq 4 from the expen mental 
measures of pitch. In subsequent experiments the same equation 
has been shown to describe the mechanism of similanty also in the 
continua of bnghtness (the subjective continuum was defined as 
darkness) and visual area (Ekman, Goude, and Waern, 1961) as 
well as heaviness (Eisler 1960) 

Since Eq 4 has been verified in four quite different perceptual con 
tinua it appears to possess a certain degree of generality It is one 
of the lew Stnctly quantitative pnnciples or "laws” known to desenbe 
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a psychological mechanism An outstanding feature of Eq 4 is the 
absence of empirical constants One thing I want to stress here is that 
the simplicity of this relation could be discovered only because it was 
investigated exclusively within the subjective domain Suppose that 
we had tried to investigate subjective similarity m terms of tonal 
frequency, other stimulus variables being equal At best we could 
have arrived at a rather complex empirical equation which it would 



fig 4 Experimental estimates of similarity plotted against theoretical values 
computed according to Eq 4 from a ratio scale of pitch. The data exhibit only 
a random fluctuation around the diagonal line representing perfect agreement 
Similar results have been obtained for brightness, visual area, and heaviness. 
{ Reproduced from Eisler and Ekman, 1959 ) 

not have been possible to "understand" immediately, because pitch 
is a nonlinear function of frequency and because loudness would have 
been a second subjective variable in the experiment I think that 
these considerations are important, because they seem to indicate 
the possibility of discovering psychological mechanisms that may 
turn out to be rather simple when they are studied on the proper level 
Let me add a few concluding comments on the similarity principle. 
My first guess in planning these experiments was that similarity might 
be a simple ratio between the two scale values (s tJ - Rt/R h where 
Rt^R,) When the data were plotted, we immediately had to reject 
this hypothesis, and then Dr Ersler had a hard time fitting various 
functions until he found one that yielded acceptable results It was 
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first written as 198Hy(fl, + Ri) We were fortunate in that 19S 
is pretty close to 2, and wc could write Eq 4 without empirical con 
slants, letting the factor 2 represent an averaging operation 
Written m this form, Eq 4 tells us that the subjective similarity 
between two percepts varying in one subjective dimension is the 
ratio between the lower scale 
value and the arithmetic mean 
of the two scale values This m 
terpretabon is represented in the 
left hand part of Fig 5 It is 
certainly simple and, m a sense, 
intelligible But perhaps we can 
“understand” the perceptual 
process even better by writing 
the same equation m the form 


A 


i 


fig 5 Illustrations of the similarity 
principle expressed by Eq 4- The left 
hand part of the graph represents Eq 4 
and the right hand part the alternative 
form, Eq 4a. For explanations, see text 
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2 R. 
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R. + R, 

which is represented in the 
right hand part of Fig. 5 Let us 
see how it can be interpreted 
There are two different percepts 
of intensities R» and R/, both present simultaneously as they are m the 
experimental situahon. The similarity generating process does two 
things with this material (1) it counts off the part of R/ that cor- 
responds to Ri (shadowed areas), and (2) it computes the ratio be 
tween this common subjective magnitude 2Ri and the total subjective 
magnitude Ri + Rj present in the experimental situahon 

Mathematically the two forms of Eq 4 are idenheal, but person 
ally I now prefer the latter form, since it implies a more concrete 
model of the operation performed in the similarity generating process 
of a human nervous system What this operation would look like 
from the point of view of a brain ph>siologist I do not know, but it 
would certainly be interesting to know 


Extensions of Psychophysics 

All the work I have discussed, in this paper has been earned out by 
means of those direct ps>chophysical methods that I mentioned m the 
first section. The introduction of these methods by Stevens and his 
co-workers initiated the rapid development of modem psychophysics 
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These methods can be used only m experimentation with human 
subjects, since they require the subject to estimate relations between 
subjective magnitudes This becomes a serious limitation as soon as 
we want to extend our work to include animal experimentation. There 
are many reasons for expecting such extensions to become important 
within the near future Let me mention just two of them (1) Some 
of the psychophysiological work that seems desirable, from a psycho 
physical pomt of view, will necessarily have to be done with anim als 
(2) Many intervening variables or hypothetical constructs (or what- 
ever names they may go by) in learning theory are just the sort of 
things we are measuring every day in our human laboratories sub 
jective variables like pain and similarity For well known reasons we 
usually want to investigate learning as dependent on such variables 
on the animal level, and accordingly we would need techniques for 
measuring them in animals too 

For these reasons I would like to draw attention to another psycho 
physical tradition, initiated long ago by Thurstone (1927a h) on the 
basis of test theory and developed m quite different directions These 
methods may be called tiidirect methods, since they are based on a 
set of assumptions intervening between the experimental data and 
tlie final scale Experimentally these methods are sometimes even 
better adapted to animal than to human experimentation The kind 
of data for which they have been developed is in the form of right or 
wrong solutions, or of preference in a choice situation This is precisely 
the sort of data you obtain in maze performance and other situations m 
which animal behavior is studied The usefulness of these methods 
for measuring certain “underlying “ not directly observable variables 
(response strength) in experiments on human learning has recently 
been convincingly demonstrated by Bjorkman (1958) The pomt I 
want to make here is that these methods might be used for measuring 
‘subjective" variables (or whatever you may prefer to call them) m 
animals which cannot be asked what they experience (with human 
subjects it would be superstitious not to ask them) We shall have 
to make some assumptions, for example that the similarity principle 
is valid for the rat, and then we can proceed to investigate psycho 
logical mechanisms Let me mention here once more the possible 
simplicity of psychological relations when we investigate them on 
their proper (that is, psychological) level The important thing to do 
in order to discover a simple relation that may exist between per 
formance and motivation is to measure, say, amount of thirst— not 
hours of water deprivaUon 

One obvious step now would be to scale subjective variables, with 
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human subjects, by the techniques of both sorts of psychophysical 
measurement, the direct md the indirect methods If in a number 
such situations we obtim the same results, then we should feel more 
confidence in using the indirect methods in those casts where only 
these methods can be used 



rig 6- A comparison between two scale* obtained for tbe esthetic value of 
handwriting specimens. The "interval scale" obtained by one of the indirect 
methods (pair comparisons) is represented by the ordinate and the ratio scale 
obtained by a direct method (ratio estimation) is represented by the abscissa- 
Tiie curve represents the logarithmic relation to be expected if variability is pro- 
portional rather than constant. 


Only few experiments of this sort have been undertaken. Those 
I know about were earned out in our laboratory (Bjorkman, 1959 
"Ekman, Xunnapas and Berglund, 1961) Some unpublished data 
from the latter study may serve as an illustration. Figure 6 shows 
results from experiments in which the esthetic value of handwriting 
specimens was measured. In this graph the interval scale y constructed 
by an indirect Thurstone method (pair comparisons) is plotted against 
the rabo scale x from a direct Stevens method (ratio estimation 
scale values computed according to Ekman, 1958) The “interval 
scale” was constructed on the assumption that variability is constant 
(Thurstone s Case V) If instead, it is proportional to the average 
scale value of the stimuli then it may be shown that y a+ b log x 
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(c£ Bjorkman, 1960) This function is graphed m Fig 6, and the 
at of the data shows that the same scale (apart from an arbitrary 
zero point) may be constructed by the indirect method (on the 
revised assumption of proportional variability) as well as by the 
direct method Several studies are now being performed with the 
aim of investigating the general relations between direct and indirect 
scaling methods The ultimate purpose of these studies would be 
to find a common rationale, so that essentially the same scale can be 
constructed by either type of method 
When more experiments of this sort have been carried out, we may 
proceed, with some confidence, to measure subjective pain, and thirst, 
and sexual drive, and brightness and — why not — esthetic value in 
infrahuman species Whether, on the animal level, you prefer to 
call them "underlying” or “subjective,” is of minor concern 
Finally it may be mentioned that even in human perception there 
are many psychological variables that cannot be directly observed 
The variable intensity of that process that manifests itself in certain 
figural reversals has recently been successfully studied by Kunnapas 
(1961) by means of an indirect method Similar studies of die con 
trnuous, more or less rhythmical process which is responsible for 
fluctuations of attention" are now in progress 


Summary 

This paper includes an introductory discussion of die metric 
properties of the presumed ratio scales obtained by “direct” psycho 
physical scaling methods Reference is made to some recent expen 
ments demonstrating the additivity of such scales 
Three main types of problems, which may be investigated by psy 
chophysical methods, are outlined 
X Stimulus response relations, where response is a subjective van 
able, constitute a classical psychophysical problem formulated long 
ago by Fechner The general applicability of the power function to 
this kind of relation, which has been demonstrated especially by 
Stevens and his co workers, is pointed out and illustrated by a new 
expenment concerning the brightness of monochromatic light 
2. The possibility of investigating certain psjchophjsiological rela 
lions is discussed On the basis of existing information it is tentatively 
concluded that stimulus discrimination and, on the odicr hand, per 
cephon of subjective magnitude may be psjchophysiologically quite 
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different processes, the latter being the Gnal outcome of a stcp-by-stcp 
process of transformations. _ . 

3. Attention is drawn to the possibility of investigating certain intra- 
subjectivc relations, relations between subjective variables. A series 
of experiments concerning subjects e similarity is chosen as an illus- 
tration of this approach. It was demonstrated that similarity is a 
simple mathematical function — -without empirical constants — of the 
subjective properties of the percepts being compared. These results 
seem to indicate the possibility of discovering psychological mecha- 
nisms which may turn out to be rather simple when they are studied 
on the proper— that is to say, subjective — level 

Finally some possible extensions of psychophysical work are dis- 
cussed with special regard to animal experimentation. Measurement 
of “subjective” or perhaps rather “underlying” variables in animals 
will necessitate the use of “indirect" psychophysical methods of the 
type introduced by Thurstone, and it thus becomes an interesting task 
to study the relation between these and the “direct" methods generally 
used in human experimentation. Results from a preliminary experi- 
ment of this type arc reported. 
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Psychophysiological 

Models 


Preface 

This is not a written version of the talk I gave at the Symposium In 
that talk, I summarized experimental data on information processing capa 
bihties of human beings and described a model for categorizing patterns 
of stimulation The data were intended to set boundary conditions on the 
communicative capabilities of the over all human organism The model 
was intended to focus attention on problems that anse in learning to handle 
redundant messages It was evident from the discussion that, m order to 
bring the data and the model into resonance with the thinking of many of 
the members of the group, I would have to relate both data and model 
more closely than I did to actual, observable physiological processes For 
the material presented, however, physiological interpretations are still to be 
formulated 

Instead of insisting on the propaedeutic values of those data and that 
model, therefore, I am turning, in these written pages, to a smaller area in 
which it is possible to suggest fairly specific relations between parts of a 
model and observable neural processes The conviction that guides my 
effort remains the same as it was before a conviction that there are value 
and promise in ps> chophysiological models 

f sycnopnysio’iogica’i models seem to me to oner the possibhrty ot 
bringing together in productive interaction (1) quantitative and 
qualitative findings of experimental and (even} clinical psychology, 
(2) substantive fads of neurophysiology and nemoanatomy, (3} ab- 
stract theorems and principles of mathematics and logic, and (4) 
insights and formulations achieved in the course of developing the 
technology of information processing and communication Perhaps 
because I value that possibility very highly, Jt seems to me that it 
currently commands relatively little effort — relatively little, that is, 
as compared with efforts in purely psychological and purely physio 

49 
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logical research or in psychopliysiological research that docs not 
involve mathematical models or ideas from communication theory, 
information theory, or the >oung field of "artificial intelligence. 
Happily, however, several papers presented at the Symposium and 
several parts of the discussion contributed strongly toward the devel- 
opment of, and, I hope, an ascendency for, psychopliysiological 
models 


Procedure and Experience 

Audio analgesia is a phenomenon only recently discovered and 
not yet well understood (Gardner and Lickhder, 1959a-c) The 
essential fact is that a procedure involving acoustic stimulation but no 
conventional anesthetic or analgesic agents has made it possible for 
patients who previously had required nitrous oxide or a local anes- 
thetic to undergo dental operations without serious pain or unpleas- 
antness 

The procedure is briefly this The patient wears earphones and 
controls his own acoustic stimulation through a control box held in 
his lap It has two control knobs, one for music and the other for 
a rushing, roanng sound derived from “white” noise At the begin- 
ning of the session, the patient selects the music he wants to hear — a 
stereophonic tape recording — and adjusts it to a volume suitable for 
ordinary listening When the dentist starts to work, or when his 
work causes any discomfort, the patient turns up the volume of the 
music As soon as there is a trace or forewarning of pain, the patient 
turns the noise knob It controls the level of the rushing, roanng 
"waterfall” sound. The over all sound pressure of the noise may be 
set as high as 116 decibels above 0 0002 microbar The intense noise 
drowns out or suppresses the pam in a great majonty of cases 
There is no longer any question of the efficacy of the procedure 
in dental operations Dr Wallace J Gardner, of Cambndge, has 
used the procedure with more than 1000 patients who previously 
required chemical agents The audio analgesia was completely effec 
bve with 65 per cent, and effective enough with another 25 per cent 
that no other agent was required and the acoustic procedure was 
preferred for subsequent visits Ten other dentists have now had 
fairly extensive experience with the procedure (more than 5000 pa 
bents in all), and their experience has paralleled Gardners very 
closely The operations have included cavity preparation (drilling), 
grinding, and extraction. Gardner has had about 200 successful ex' 



ON PSYCHOPHYSIOlOGiCAl MODELS SI 

tractums including several multiple extractions— and no failures 
n short, m the dental office, the acoustic procedure appears genu 
meiy to suppress pain, to provide analgesia in the sense of standard 
dictionary definitions of that word 
Thus far, there is not enough experience with the procedure in 
nondcntal clinical situations to justify conclusions The mailable 
evidence, provided by physicians who have used the same sound 
produenig equipment as was employed in the dental observations, is 
partly positive, partly negative The procedure was effective m 
several minor surgical operations but not m all It was wholly 
sufficient m five cases of labor and childbirth but required supple 
mentation in three others It was ineffective in suppressing mtrac 
table pain associated with cancer 
In the experimental laboratory, it is difficult to make a dramatic 
demonstration of suppression of pain by sound, but it is easy to 
create a discernible effect On the basis of preliminary experimen 
tation, it appears that the factor of tension relaxation must be intro 
duccd into the laboratory situation before the degree of effectiveness 
observed m the dental office can be achieved (Weisz 1959) The 
same statement cau be made, however, with reference to morphine 
and other pain relieving drugs There is little or no difference in 
effectiveness between morphine and placebo in a typically objective, 
analytical, laboratory context (Hill, ct al, 1952) 


Psychological Characteristics of Audio Analgesia 

Experience thus far gained has revealed several characteristics of 
audio analgesia for which one would like to have an explanation The 
characteristics are by no means precisely quantified, they are largely 
qualitative and clinical 

1 In the absence of acoustic stimulation and conventional analgesic 
or anesthetic agent, pain tends to build up regeneratively during in 
tervals of strong nociceptive stimulation 

2 When nociceptive stimulation xs withdrawn, after an interval 
of pain in the absence of acoustic stimulation and the conventional 
analgesic or anesthetic agents, die magnitude of the pain sensation 
decreases, but there is often a residual paw sensation, an after paw. 

3 When an intense auditory stimulus is turned on and held for an 
interval at constant sound pressure level, the loudness rises rapidly to 
a maximum, then equilibrates or recedes to a relatively steady level 
(“perstimulatory fatigue”) 
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4 Ordinarily, after the first 100 milliseconds, there is no positive 
after sensation in hearing, but a depression in sensitivity can be meas- 
ured for a period following moderate or intense acoustic stimulation 

5 In at least some situations, intense acoustic stimulation has a 
definite suppressive effect on the subjective magnitude of pain and 
on overt pain reactions 

6 In general, effectiveness in suppressing pain increases with sound 
intensity, decreases with the intensity of the noxious stimulation 

7 Relaxed patients experience more relief from pain than do tense 
patients The main role of the music m the procedure is to relax the 
patient before the operation is begun Except for hard-of bearing 
patients and children too young to manipulate the controls effectively, 
the patients with whom the audio procedure has been least effective 
have been patients who entered the situation in extreme tension or 
anxiety 

8 In the experimental laboratory, m a psychophysical context, one 
can often detect a reduction in the pain produced by an electric shock 
or other noxious stimulus, but it is only a modest reduction, much 
less dramatic than the effects observed in clinical situations or even 
in laboratory situations in which the psychophysical context and set 
for careful observation are replaced by conditions favoring relaxation 
and diversion of attention 

9 The procedure is more effective if the noise is turned up before 
pain develops than it is if the noise is withheld until pain is clearly 
present 

10 Stimulation by intense noise appears to have a more or less 
persistent after effect on pain In some instances, after long or intense 
exposure, it has been possible for patients to undergo ordinarily painful 
operations without further presentation of noise In other instances, 
patients have been able to follow the gradual reappearance of pain 
dunng the course of seconds following presentation of a burst of noise 

11 Pam may have some reciprocal effect on the loudness of noise 
or music, but it is not as marked as the effect of sound on pain 

12 Once a pain has been suppressed it can sometimes be kept 
suppressed by a weaker sound than was at first required to over 
come it 

13 Quite a few patients have fallen asleep while listening to music 
or noise dunng a dental operation Many have fallen asleep while 
still listening to music or noise m the interval immediately following 
an operation 

14 Individual differences in use of the acoustic procedure are very 
great Some patients require intense noise, others use only moderate 
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levels and report that pain suppression is fully effective Some pa- 
tients rely heavily on the music, turning it up to full volume instead 
of using noise Others use the noise control at every suggestion of 
pain 00 

15 The several dentists who have used the audio procedure have 
had approximately the same percentage of success 

16. Random noise is more effective in suppressing pain than other 
sounds tested, with the exception of random noise plus music 

17. Although the high frequency components of “white” noise must 
be attenuated if one is to produce a sound that most patients will 
accept and willingly adjust to a high volume level, the upper cutoff 
frequency of the noise must not be made too low or effectiveness m 
suppressing pain will be lost 

16 Some patients have reported that, while using the audio pro 
cedure, they felt something they could call pain, but that it didn’t 
hurt the way pain ordmanJy does ” 

19 Against certain types of pam, such as the pam produced by 
an electrodepilatory needle, photic stimulation appears to be more 
effective than acoustic stimulation (Baruch and Fox, 1959) 

20 Distraction of attention from pain to something else appears 
to reduce pam 

The items just listed do not provide a basis for curve fitting for 
the calculation of coefficients of correlation, for any of the activities 
usually associated with the development and testing of mathematical 
models Their qualitative nature may appear, indeed, to bar them 
from the realm of models Yet it seems worth while to try to organize 
the picture m some other way than merely to list the 20 statements 


"Explanations" of Audio Analgesia 

Almost surely, several factors operate together w producing audio 
analgesia Their relative weights or importances vary from patient to 
patient and situation to situation. These seven seem usually to be 
the main ones (1) direct suppression of pain sensation, either the 
primary pam or the secondary elaboration of, or reaction to, the pri- 
mary pain [reference is made here to Beechers (1957) distinction), 

(2) relaxation, with consequent reduction of the elaboration reaction, 

(3) distraction of attention from pam, with consequent reduction of 
the apparent seventy of the pam, (4) masking of the dentists dnll 
(in operations in which a drill is used), with consequent reduction in 
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the anxiety conditioned to the drill through earlier painful experi- 
ences, (5) improved communication with the dentist (through the 
changing levels of music and noise, which the dentist hears through 
his own earphones or monitor loudspeaker), with consequent reduc- 
tion in fear, anxiety, or feeling of helplessness, (6) active role for the 
patient (who now can control a massive part of his experience in the 
dental chair), with consequent reduction in fear, anxiety, or feeling 
of helplessness, and (7) suggestion. 

The seven {actors have been cited, individually and collectively, 
as “explanations” of audio analgesia Clearly, they arc not explana- 
tory in the reductionist sense, they do not describe any mechanism 
or process Instead, they relate characteristics of the audio procedure 
to generalities that cover other situations than audio analgesia, and 
thus perhaps lend some plausibility to what might otherwise seem 
wholly implausible This land of “explanation” seems unsatisfactory, 
but it is about all that can be achieved without recourse to hypotheses 
that involve the neurophysiological substratum of behavior 


Neurophysiological and Neuroanatomical Constraints 
on Models of lntersensory Interaction 

Neurophysiological and neuroanatomical researches are making 
such rapid strides toward an understanding of interactions among 
sensory modalities in the reticular formation and related centers of 
the bram that it may soon be possible to specify precisely the loci 
involved m the suppression of pain by sound. For the tune being, 
however, we may concentrate on process and neglect considerations 
that point to particular regions or nuclei, noting only that there is 
abundant opportunity, at levels from the upper boundary of the 
medulla to the cerebral cortex, for sound signals and pain signals 
to come together What are some of the general neurophysiological 
and neuroanatomical considerations that may serve as guide lines or 
constraints in the development of a model of the interactive process? 

L The organization of the nervous system is characterized by focal 
ized excitation or facilitation and diffuse inhibition or suppression 
(The papers presented at this Symposium by Ratliff and Mountcastle 
provided beautiful examples of this principle of organization ) Where 
two or more sensory systems come into a common region but do not 
map themselv es, point for point, into a common projection, therefore, 
we may expect the effects of one system on another to be inhibitory 
or suppressive rather than excitatory or facilitory 
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2 It is necessary for the sake of stability that, on the average, inhibi- 
tory processes have higher gains than excitatory processes at high 
ievels of activity, and that excitatory processes have higher gams than 
inhibitory processes at low levels of activity Otherwise, the brain 
would either run down” into quiescence or undergo a regenerative 
chain reaction” up to saturation 

3 The temporal parameters of excitatory processes are usually 
shorter than those of inhibitory processes This fact may be related 
to the localization of the former and the diffuseness of the latter 

4. There is measurable activity in the neural substratum (for ex 
ample, “spontaneous activity”) even in the absence of contemporary 
stimulation and in the absence of (reliable) sensation (The word 
reliable” is introduced to avoid argument here about thresholds ) It 
seems reasonable to postulate an underlying pain process that has 
continuous existence, though at low levels, during intervals in which 
there is ordinarily no report of pain 


A Psychophysiological Model of the Process 
Underlying Audio Anaigesia 


The interests of simplicity and generality pull w opposite directions. 
Because the facts to explain are so few and so qualitative, there is 
justification, at best, for only a very simple modeL On the other hand, 
we may expect— we know, in fact, that there are— interactions among 
all the sense departments, and it seems restrictive to limit considera- 
tion to only hearing and pain, or to only a given pattern of facihtory 
and inhibitory coefficients The compromise has been to formulate 
a general model, one with a variable number n of channels, each 
channel having variable parameters, but to exercise in the trials to 
be described here only a few specific realizations in which n equals 2. 
The effect, therefore, is clearly on the side of simplicity It is, in 
fact, deliberate oversimplification, to represent a neural mechanism 
that involves several cascaded networks, each containing thousands 
of nerve cells characterized in part by all-cr none response, by a mere 
pair of channels involving only a few continuous variables The 
object, however, is not to construct a brain. It is to construct a con- 
ceptual network (1) that is easy to understand, 2) that bears a recog- 
nizable (and potentially improvable) resemblance to part of the 
nervous system, and (3) that will behave in ways that manifest the 
characteristics of audio analgesia. 

The model incorporates some, but by no means all, of the ideas 
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fig. i 5chemalic diagram of the model. The nodes represent variables, the 
branches operations. The stimuli are *i (sound) and r; (pain) The responses 
are r y (loudness process) and r. (pain process) The variables f rt , /,*, / tl , and 
are self and cross-feedbacks denied from fj and r 3 through the successive 
operations of dipping (cw), smoothing (*<j), amplification ( +gn), and addition 
The over lU effect is to produce two stimulus-response channels, each subject to 
both positive and negatn e feedback from its own response and from the response 
of its neighbor 


about the pain mechanism that hate been described by Hardy, Wolff, 
and Goodell (1952), Beecher (1957), and Melzack, Stotler, and Liv- 
ingston (195S) 

The model is illustrated schematically in Fig. L The stimulus or 
input is si The response or output is rj The action of the network 
can be described most readily if we assume, for the moment, that the 
node marked with superposed + and X signs is simply an adder, 
as all the other nodes in fact are. 
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The response r x is formed by adding together the stimulus s,, the 
self feedback signal f, u and the cross feedback signal f, 2 The self 
feedback signal /„ is formed from r, by dipping (c n and c,«) 
smoothing (*n and s 12 ), amphfymg (g n and -g,,), and adding 
(at the node labeled /,,) The characteristic of a typical clipper is 




fig 2 Details of operations in the model (a) Behavior of a dipper that passes 
only the parts of the signal that exceed a preset threshold T (6) Conventional 
analog integrator with negative feedback, equivalent to a resistance-capacitance 
smoothing filter, with tune constant determined by the setting of the potenh 
ometer The amplifier following the integrator makes the gain of the network 
positive, (c) A digital smoothing circuit The output is the weighted sum of 
the input { weight 1 — t) and the quantity ( weight t) accumulated in the rescr 
voir The quantity in the reservoir is a remnant of past values of the input 
The result is dosely similar to that provided by (6) (d) The network that 

provides a weighted sum of addition and multiplication at the nodes marked by 
superposed + and X signs in Fig 1 Appropriate adjustment of the potenti 
ometers provides the response r = a(s +/«+/«) + (1 a) {si.) 


shown schematically in Fig 2 a The threshold Tjo of cj 2 is higher 
than the threshold T lt of c tI That gives the posibve feedback, path 
the advantage at low signal levels An analog smoother is illustrated 
in Fig 2 b, and a digital smoother is illustrated m Fig 2c In the 
latter, r is a multiplicative coefficient and (1 — r) is its comple- 
ment, abs is an operator that delays the signal one interval of cal 
dilation tune and then replaces the signal by its absolute magnitude 
Both Figs 2b and c approximate the action of a simple resistance 
capacitance smoothing filter The time constant of Sj 2 is longer than 
that of s« That makes the negative feedback more sluggish than 
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Fig 1 Schematic diagram of the model. The nodes represent variables, the 
branches operations The stimuli are #i (sound) and Sg (pain) The responses 
are fj (loudness process) and r 2 (pain process) The variables f, v , f t * f cl , and 
fa are self and cross-feedbacks denved from r i and r 2 through the successive 
operations of clipping (c«/), smoothing (*«j), amplification (+£•/), and addibon 
The over all effect is to produce two stunulus-i espouse channels, each subject to 
both positive and negative feedback from its own response and from the response 
of its 


about the pain mechanism that have been described by Hardy, Wolff, 
and Goodell (1952), Beecher (1957), and Melzack, Stotler, and Liv- 
ingston (1958) 

The model is illustrated schematically in Fig 1 The stimulus or 
input is The response or output is r x The action of the network 
can be described most readily if we assume, for the moment, that the 
node marked with superposed + and X signs is simply an adder, 
as all the other nodes m fact are. 
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spiff 0 r rr° Tl w f0m>C ? by add " !S ,0gethor *« s,, tie 

If feedback signal /„, and the cross feedback signal f ci The self- 
feedback signal /„ ts formed from r, by dipping ( Cll and c , 2 ) 

ampUymg ( S“ and -Sis), and adding 
(at the node labeled /,i) The characteristic of a typical clipper is 




(d) 


Ha 2. Details of operations m the model (a) Behavior of a clipper that passes 
Only the parts of the signal that exceed a preset threshold T (b) Conventional 
analog integrator with negative feedback, equivalent to a resistance-capacitance 
smoothing filter, with time constant determined by the setting of the potenti 
Ometer The amplifier following the integrator makes the gain of the network 
positive (c) A digital smoothing circuit The output is the weighted sum of 
the mput (weight 1 — r) and the quantity (weight r) accumulated in the reser 
voir The quantity in the reservoir is a remnant of past values of the input 
The result is closely similar to that provided by (b) (d) The network that 

provides a weighted sum of addition and multiplication at the nodes marked by 
superposed + and X signs in Fig 1 Appropriate adjustment of the potenti- 
ometers provides the response r=n(s + /* + /•) + (1 -*• 

shown schematically in Fig 2a The threshold T 12 of c 12 is higher 
than the threshold T u of c lt That gives the positive feedback path 
the advantage at low signal levels An analog smoother is illustrated 
»n Fig 2b, and a digital smoother is illustrated in Fig 2c In the 
latter, r is a multiplicative coefficient, and (1 ~ r) is its comple- 
ment, abs is an operator that delays the signal one interval of cal- 
culation time and then replaces the signal by its absolute magnitude 
Both Figs 2b and c approximate the action of a simple resistance- 
capacitance smoothing filter The time constant of s l2 is longer than 
that of s tl That makes the negative feedback more sluggish than 
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the positive feedback Finally, g tt is greater than g n That gives 
the advantage of greater gain to the negative feedback patlis when 
tlie signal level is much greater than the clipping thresholds 
The signals /,«, f t u and arc derived in ways that correspond 
directly to the one just described We have in sum, therefore, both 
positive and negative feedback from each channel to itself and from 
each channel to every other The only thing that remains to be 
described is the node with the superimposed + and X signs 

Consider two neural channels that come together and impinge 
upon a thud The response z of the third is a function of the levels 
of activity * and y of the incoming two z — r(r, y) In what is 
perhaps die simplest concept, F is the sum z — x + y In another 
interpretation almost equally familiar, F is the product z — x y 
To achieve a helpful but limited degree of generality, wc may com- 
bine the two notions linearly and let the equation z — F(x, y) be 
s=*a(x+ y) + (1 — o) (x y), with 0 i? a 1 That is essen 
tially the interpretation of F used m the model In the model how- 
ever, three “neural channels," not two, impinge upon the outgoing 
channel 

The interpretation actually cmplo>ed in the model is shown in 
Fig 2 d The three incoming signals are s, f 4 , and f c The response r 
of the outgoing channel is a(s + /, + f e ) + (1 — a)(s /,) That 
limits the nonlinear (multiplicative) action to the self feedback signal 
f, and the input s To a degree controlled by (1 — a), the self feed 
back signal turns a valve through which the input flows to yield the 
response The input variable s is never negative 


identifications between Variables of the Model 
and Variables of Audio Analgesia 

The interpretation. o£ the model is direct. The two channels are 
parts of the auditory and pain systems The input variables s t and 
*2 are the strengths of acoustic and nociceptive stimulation, respec 
Uvely The output variables r 2 and r 2 are the strengths of the sound 
and pain processes, respectively The subjective magnitude of the 
sound (loudness m sones) is a monotonic, nondecreasing function of 
fj, and the subjective magnitude of the pain (subjective painfulness 
in dols) is a monotomc nondecreasing function of r 2 For present 
purposes we may say that loudness L is equal to — c x when r x ^ c u 
and L is equal to 0 when r x ^ c u and we may say that subjective 
painfulness P is equal to r 2 — c 2 when r 2 ^ c 2 , and P is equal to 0 
when r 2 g c 2 To simplify things further, we may let c* s: 0 For 
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the pain channel, however, it seems better to assume tint there is 
no ordinarily reported pain until the pam process exceeds a threshold 
significantly above zero 

Tile thing in the model that corresponds most directly to “level of 
tension in a patient is the sum of the feedback signals Note that 
tins is not a unitary thing Tension, as the word is used m cluneal 
contexts, is a global term In the model, even oversimplified as it is, 
there are two “tensions* in the auditory $>stem and two more m the 
pain s>stcin One member of each pair (“anxiety,” perhaps) has a 
direct effect upon the gam of the channel, the other has only an 
indirect effect 

The clipping threshold T 2 i associated with c 2 i m the model bears 
a relation to tolerance for pam in the actual patient If T 2 i is high 
tlie regenerative growth of pam docs not begin until the level of sub 
jeetive pam is fairly high, if T 21 is low, the regenerative build up 
begins at once The thresholds associated with the other clippers 
may be given corresponding interpretations 


Behavior of the Model in Response to Stimulation 

The model has been realized in two forms a digital computer pro 
gram and an analog computer setup Because the digital computer 
is a small, slow maclune, the digital program has been run m only 
a few configurations, all with a = 0, which specifies linearly additive 
combination at the nodes at which stimuli and feedbacks are com 
bmed The digital program, however, can be changed from two 
channels to twenty by writing “20" on a typewriter, and it thus has 
the advantage of growth potential The analog computer set up pro 
duces results much faster It has been run with both linearly additive 
and partially multiplicative combinations at the crucial nodes 

In describing the simulation tests, we shall refer to the variables 
as acoustic and nociceptive stimuli, sound and pam responses, and 
so on That seems preferable to the alternative of designating them 
always by letters because (1) it brings to the foreground the linkages 
between variables of the model and variables of the situation repre 
sen ted and (2) there seems to be little danger of confusion between 
the two realms of discourse 

Behavior of the digital simulation 

The behavior of the digital program in a long run with three modes 
of stimulation is shorn m Fig 3 The parameters of the model for 
this run are given in Table 1 
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Hg. 3. Behavior of the digital simulation, (a) Responses to a senes of four 
noxious stimuli. The stimuli occur at the intervals 9-19, 39-49, 69-79, and 
93-109 sec. Their relative strengths are I, 2, 4, and 8 (b J Responses to a 
corresponding senes of four acoustic stimuli. Note the depression of the already 
subliminal pain process. In (e) acoustic and nociceptive stimuli are presented 
together The acoustic stimulus is on at strength 4 during intervals 309-339, 
369—399, 439-439, and 489-499 see. The noxious stimulus is on during intervals 
319-329, 369-379, 429-439, and 489-499 sec- 

responses of both the auditory and pain channels are shown The pain 
response nses rapidly as soon as the nociceptive stimulus is appbed, 
continues to build up during the course of stimulation, and drops 
rapidly but not to zero level, when the stimulus is removed. The 
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Table 1. Parameter* of Digital Simulation for Te*t* Shown in 

Fig 3 
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nociceptive stimlus has a small effect on the loudness process A 
corresponding effect of actual pain on actual loudness might be made 
noticeable in a suitably designed psychoacoushc experiment In the 
dental situation, however, such a small suppressive effect would not 
be noticed 

For Fig 3 b, only acoustic stimulation was employed The appli 
cations are graded, as they were in Fig 3 a While the stimulus is 
on, the loudness process holds almost a steady level When the stim 
ulus is terminated, the loudness process falls to subzero values This 
is analogous to the “poststimulatory fatigue” observed in auditory 
tests ( Causs£ and Chavasse, 1947, Gardner, 1947, Luscher and Zwis- 
locb, 1947, de Man*. 1939) 

For Fig 3c, both acoustic and nociceptive stimulation were em- 
ployed We may examine the effect of the sound on the pam by 
comparing Fig 3c with 3a It is the temporal course, rather than 
the magnitude, of the suppressive effect that is of primary interest 
[The effect can be made as marked as is desired, of course, by manipu- 
lating parameters of the model The magnitude of the effect observed 
with this particular configuration of parameter values (Table 1) is 
characteristic of a very successful application of the audio procedure 
to a favorable subject] It is noteworthy that the effectiveness of the 
acoustic stimulation m suppressing pam is greatest when the acoustic 
stimulus is applied before the painful stimulus, and that there is a 
small “twinge” of pain at the beginning of stimulation when the two 
stimuli are applied simultaneously Corresponding effects are noted 
repeatedly m dental operations 

Behavior of ihe analog simulation 

Figure 4 shows a sample of the behavior of the analog simulation 
Table 2 gives the corresponding parameters Tliroughout the record. 
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Tme (sec) 

rig 4 Behavior of the analog simulation. The chart shows two contiguous 
100- sec intervals of stimulation and response The acoustic and nociceptive stimuli 
are marked *i and » 2 , respectively, and the corresponding responses, fi and f 2 


Tobte 2. Parameters of Analog Simulation for Tests Shown in Fig 4 
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a fairly strong nociceptive stimulus is turned on and off periodically. 
A relatively weal: acoustic stimulus is turned on for a prolonged 
interval covering the middle portion of the sample and, again, for short 
intervals near the end of the sample With the parameters of Table 2, 
the loudness process equilibrates rapidly and then holds a steady level 
A corresponding temporal course has been observed for actual loud- 
ness by Hood (1953) and Egan (1935), m psychoacoustic experi- 
ments When the stimulus is terminated, the loudness process falls 
to subzero values 

The suppressive effect of the acoustic stimulation appears on two 
quite different time scales The gradual, progressive suppression pro- 
duced by the long presentation of the acoustic stimulus is clearly 
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evident in tlie middle part of die record A more immediate, but less 
marked, effect can be seen at die onset of acoustic simulation The 
latter effect is clearest in die traces near the end of the record, where 
die sound is turned on and off during elch of several bursts of pun 



Fig S Behavior of the analog simulation The layout Is similar to that of Fig 4 
Compare the second pain response during quiet with the first and third dur g 
noise. The fourth and fifth paw responses are entirely suppressed. 


stimulation The great difference between the effect producible in a 
moment and the effect that can build up over a period of time should 
be borne in mmd 

Figure 5 shows another sample of the behavior of the analog simu 
Jabon, this time with the parameters of Table 3 In Fig 5 the acoustic 


Table 3 Parameters of Analog Simulation for Tests Shown in Fig 5 
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stimulus is not strong enough relative to the painful stimulus, to 
suppress the pain process rapidly 



M J. C. R. UCKUDEB 


The painful stimulus and tlie acoustic stimulus are presented exactly 
together in every stimulus cycle but the second In the second, the 
acoustic stimulus is withheld The pain is clearly worse in the ab- 
sence than in the presence of the sound In the latter part of the 
record, the successive presentations of the sound gradually suppress 
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Table 4 Parameters of Analog Simulation for Tests Shown in Fig 6 
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lus goes on and off periodically Its intensity while on is constant 
An acoustic stimulus also goes on and off, but at irregular intervals 
The intensity of the acoustic stimulus is varied several tunes during 
the run 

At the beginning of line I in Fig 6, the acoustic stimulus is off The 
pain process is going through cyclic variations m which it builds up 
to quasi saturation when the noxious stimulus is on and recovers par 
bally while the noxious stimulus is off The quasi saturation level is 
determined by the interplay of the positive and negahve feedbacks 
Each tune the painful stimulabon is removed, the pain process drops 
suddenly, but only for a short tune Then for a short time it holds 
a high level (again, determined by the interplay of the feedback 
signals), and then it declines exponentially toward its quiescent 
operating point 

A brief burst of intense sound (intensity 9) at A in Fig 6 has a 
barely discernible immediate effect on the pain process, die effect on 
the recovery phase is somewhat more evident 
The intense sound stimulus beginning at B has only a minor effect 
during the pain stimulus interval because of the great intensity of 
the pain stimulus As soon as the pain stimulus is removed, however, 
the sound dnves the pain process down rapidly Note, particularly, 
that, once the pain process has been suppressed, even die strong noci- 
ceptive stimulus is unable to break through the suppression provided 
by die sound Only when the sound is turned off does the paw proc 
ess rise Then, by the second cycle, it is back to its quasi saturation 
and parbal recov ery behavior That behavior continues through eight 
pain cycles (lines 2 and 3) without further acoustic stimulation 
When the sound is turned on again at C (intensity 9), the painful 
stimulus is off The sound rapidly suppresses die pain process, but 
the painful stimulus comes back on before full suppression is achieved. 
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The burst of pain stimulation elevates the pun process but does not 
succeed in raising it to quasi saturation because it is working against 
the influence of die sound During die second pain stimulus free 
interval, the suppressive effect of the sound fully depletes the pain 
process, and there is no more pain activity until the sound is turned off 
Beginning at D, the record illustrates effects of bursts of sound 
presented during die intervals of painful stimulation As before, the 
effects are noticeable mainly in the recovery intervals 

Thus far, during the run, the acoustic stimulus has been cither at 
intensity 9 or at intensity 0 At E, shortly after an onset of the pain 
stimulus, the acoustic stimulus is turned on again at intensity 9 By 
the second pain cycle, it has fully suppressed the pain process Then, 
at the beginning of line 5, the strength of die acoustic stimulus is rc 
duced to 6.5 Even at the reduced level, it is effective in preventing 
the nse of the pain process during the following cycles During one 
of them, at F, there was difficulty with an “earphone lead,” and the 
pam process almost was able to resume activity The difficulty was 
corrected quickly, however, and the pam process subsided. 

At G, the acoustic stimulus was again reduced in strength, to 3.5 
At this new level, the suppressive effect was not great enough to hold 
down the pam process, which returned to its cycle of quasi saturation 
and partial recovery 

Jvext, at H, the acoustic stimulus was turned off, and then back on 
a gam at intensity 6 .5 As before, it gradually overcame the pain and 
reduced the pam process to near zero level. Again the sound intensity 
was reduced — to 5 at I and then to 4 at J — and agam the weaker 
acoustic stimulation was effective in holding the pain process in abey- 
ance through several cycles When the weaker acoustic stimulus was 
then turned off at K, the pam process resumed its familiar course 
Note, particularly, now, at L, that when the weaker acoustic stunu 
lus was turned on agam, it was unable to have much effect on the 
pam It could hold the pam process down when it caught the pam 
process m a suppressed state (/), but it could not overcome the pam 
process when the latter was fully active (L) The run ends with 
several cycles of quasi saturation and partial recovery despite the 
presence of the not qintc-sufficient acoustic stimulation. 


Evaluation of the Model 

The model is admittedly preliminary, tentative, and oversimplified 
Little comes out of it that was not put into it other than the demon 
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stratum tint the network actually docs what intuition says it must do 
Of course, the model can relate the behavior to the parameters more 
accurately than nn intuitive grasp of the process alone can do, but 
that is not now in important capability because individual differences 
among actual patients correspond to wide ranges of parameter values 
If we count all the possibly variable parameters, we find that the two 
channel model with addition plus multiplication at the critical node 
has about 26, winch is greater thin the number of facts we have 
against which to check the behavior of the model From one point of 
view, that is a disastrous situation Train another, one sees that most 
of the 26 parameters need never be varied and that the variations 
that are made are constrained by predetermined patterns, but those 
observations do not wholly overcome the objection that there are 
too many degrees of freedom ” 

Is such a model, then, worth while? Does the fact that character 
isUcs of its behavior parallel characteristics of actual audio analgesia 
give it any value or significance? 

In approaching that question, it may be helpful to review the corre 
spondence between model and actual characteristics Of the twenty 
characteristics we listed, the behavior of the model directly displays 
the first twelve Implicit in the simple two channel version of the 
model are places for three more of the characteristics (Nos 13, 14, 
and 15) Increasing the number of channels would accommodate 
four more (Nos 16, 17, 18, and 19) An extension would take care 
of the remaining characteristics (No 20) 

Some of tlie correspondences are obvious Others require mterpre 
tive notes 

7 Effectiveness of suppression increases with relaxation decreases 
with tension In the model, relaxation tension is identified with the 
feedback signals, which can be monitored individually In versions 
of the model involving multiplication at the critical node, the gam of 
the pain channel depends upon the level of self feedback That level, 
in turn, depends upon the level of cross feedback from the auditory 
channel 

8 The immediate suppressive effect (which corresponds to the 
effect that can be observed in a psychophysical context) is less marked 
than the effect that develops gradually 

12 When the pain process and the feedback signals are at low 
levels, the gam of the pain system is low, and the suppressive influ 
ence of the sound has less to overcome 

13 If we identify sleep with extremely low levels of tension 
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(Jacobson, 1933), and if we focus attention on the pain channel, we 
see that, in a few instances during the stimulation runs, the audio 
procedure induced sleep That, of course, is an oversimplification, 
but it corresponds well with experience in the dental office, and it may 
go even further Preliminary observations indicate that infants may 
be put to sleep by noise of the type used in the audio procedure 
14. Varying the parameters changes the amount of sound required 
to suppress a given amount of pain It seems reasonable to suppose 
that there are large individual differences among people m respect 
of the neural parameters that correspond to the simulation network 
parameters 


15 The dentist or experimenter enters the picture only through 
his manipulation of stimulus variables and of such parameters as 
relaxation tension. In the actual, cbm cal situation, that is an over- 
simplification, of course Nevertheless, the fact that appro ximatel y 
the same degree of success was achieved by the several dentists, all 
of whom were aware of the importance of relaxing their patients, is 
just what would be expected on the basis of the model, and not at 
all what would be expected if subtle suggestion, or even strong sug 
gestion bordering on hypnosis, were essentially involved. 

16-17 In a version of the model with many channels, several 
channels could be identified as auditory and several others as pain 
chan ° e J s To thieve effective suppression, with such a system, it 
would be necessary to stimulate many or all of the auditory channels 
The advantage claimed by random noise because of its continuous 
coverage of the frequency scale would then be reflected by the be- 
havior of the model. The advantage inherent in temporal continuity 
is, of course, reflected by the behavior of even the two-channel modeL 
lb in a multichannel model, the parameters would vary among 
the several pain subchannels Suppression should then change the 
quahty of experienced pain if it did not abolish the pain entirely 
. , a ““^channel model in which parameters varied among the 

channels, one seme modality would be likely to have an advantage 
Su PaiI J mother sense modality against another 
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system would have a crude analog of attention, and its “attention" 
would bo distractible. We may return, now, to the question of the 
value of such a model as the one we have been discussing 


The main value, it seems to me, stems from the role of the model 
as an organizer The model pulls a variety of facts together into a 
compact diagram, and it interrelates them, one to another The model 
consolidates the experience thus far obtained and makes it easy to 
go on to extensions and ramifications without losing ones grasp on 
the things one has already dealt with 
A second value that is important m the context of audio analgesia 
stems from the fact that, although the model displays rather subtle 
and complex behavior, it is mechanistic That might not be mipor 
tant m a reductionist, laboratory research context, but it is important 
in the clinical context The mode! is useful, for example, in providing 
a nonmentalishc rationale for the part of the procedure intended to 
facilitate relaxation The model even offers some help m clarifying 
the question of the role of suggestion it is often suggested that audio 
analgesia is a form of hypnosis Hypnosis and audio analgesia share 
the involvement of relaxation and the suppression of pain without 
the aid of chemical agents However, audio analgesia involves no 
special relation between patient and dentist, and it appears to be 
effective for a larger percentage of patients The model serves to 
turn the question of the relation between audio analgesia and hypnosis 
from semantics to experimentation One begins to see how the 
model might be extended to make a place for suggestion and to 
yield behavior similar to some that is observed under hypnosis One 
begins to ask not "Is audio analgesia hypnosis?” but “What specifiable 
processes underlie analgesia, and what specifiable processes underlie 
hypnosis?" 

That takes us immediately to the third hoped for value of the 
model that it will be useful in formulating experiments and m lead 
ing to new observations As mentioned earlier, the model suggests 
that we look at intersensory effects m general and try to relate to the 
model the many published results of that field That will be a diffi- 
cult and probably frustrating task More specifically, the model 
suggests an experiment to determine the effect of pain on loudness 
It makes connection with ongoing experiments on effects of sound on 
sleep and m inducing sleep And, finally, the model demands quan 
titative data on audio analgesia It makes it clear why experiments 
earned out in a traditional psychophjsical context are unlikely to lead 
to m teres ting results In doing so, it may possibly introduce new 
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vanables into psychophysics It seems likely, for example, that the 
subject’s state of tension relaxation may have a significant effect upon 
subjective magnitudes such as loudness, though perhaps not upon 
sensitivity to weak signals 

There is another side of the question of value, however, and it dis- 
turbs me deeply My primary impression of the Symposium was that 
many neurophysiologists are so engrossed in their substantive, purely 
neurophysiological work, and that it is reinforcing them so strongly 
through advances m understanding of the nervous system, that they 
do not have time for models There is no way of escaping the cost 
Even to set up and exercise such a simple model as the one described 
here required dozens of hours on the digital computer and several on 
the analog computer. It is not wholly clear to me that the time would 
not have been better spent with electrodes in a brain or even ear- 
phones on ears 

My ingrained predispositions toward the realm of models and 
toward the effort to correlate psychological and phjsiological data 
were given a sudden shock excitation by a recent letter from Vernon 
Mountcastle (1959), whose superb presentation at the Symposium 
greatly clarified my impressions of the reticular formation and the 
posterior group nuclei, where networks of the kind postulated here 
may be presumed to lie Mountcastle has found cells, both m the 
posterior group nuclei and in the cerebral cortex, which respond to 
nociceptive stimulation and whose responses arc suppressed by 
acoustic stimulation His finding raises the question, whether models 
can lead to experiments as fast as experiments can lead to models 


Summary 
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with the feedback, paths With values of those parameters that are 
reason ible on a prion grounds, the behavior of the model reflects 
many of the charactcnstics of audio analgesia 
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perhaps be talking in terms of “envelopes ot contraction and relaxa- 
tion,” or some such, we find ourselves saying, “The animal obtained 
food,” or “The subject made a judgment” We tend to classify re- 
sponses m relation to social significance. Even the statement, ‘The 
rat pressed the bar,” implies a httle teleology. 

The broadest relations available for study and analysis, then, are 
those between “things” in the world and the complex behavioral 
sequences they ultimately evoke — what perhaps had best be subsumed 
under the term, "conduct.” This set of relations between objects and 
conduct defines the domain of behavior theory. It also sets the stage 
for the first approach to any new communication problem, for when, 
now, we ask about the potentialities of a little-used sense channel for 
mediating messages, the broad relation between tilings and conduct 
has to be faced 


Two radically opposed approaches may be made to the assessment 
of cutaneous communication possibilities. If a person is interested 
in the “things” that go into conventional communication systems and 
in trying to utilize them in novel ways, he may ask, for example, what 
the skins capacity is for feeling the disturbances present in a tele- 
phone receiver or other electromechanical transducer, and interpreting 
them This, in fact, is what was asked years ago by those who got 
interested m ‘hearing through the skin” programs Their approach 
was reasonable, given as a starting point the considerations that the 
sbn is quite capable of apprehending mechanical vibrations and that 
the world contains vast quantities of hardware capable of transducing 
speech and music into mechanical vibrations of a power level that 
can be felt by the skin The argument was an evolutionary one — the 
eardrum which does so well at picking up and transmitting the fine 
dance of the air molecules present in speech, is a descendant of a 
cruder but not dissimilar tissue also capable of behaving with some 
efficiency m acoustic systems Why not simply tram the skrn to do 
substantially what the tympanic membrane does? There were many 
patient experiments, and enough came out of them to encourage per- 

finl 7, HUman SpCeCh ’ sultabIy ™PMed, was applied to the 
fingers After a sequence of 28 half-hour training periods, in one 

3 \ ? feoler t could '**"> to judge, with about 75 per-cent 

slm 7fi"' hlC ' 10 Sho,t sentcnces had been delivered to the 

ouTcom 'T 0urs oJ p ' actlce - sln gte words, presented with- 

these * . couIcl be rec °gnized about half the tune However, even 

lus rato nf CVels ,° f perfo ™ anco deteriorated if the talker changed 
° r ,£ anoU, “ tat in for him The efforts 

getting the skin to encompass a language not natural to it eventually 
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petered out, the approach by way of adapting existing hardware had, 
for all practical purposes, failed 

If one attach is to force a receptor system to perform m an un 
natural way by trying to adjust it to the worlds hardware, the other 
involves asking how the world and the things in it might be modified 
to get the most out of the senses What discriminations are possible 
for tile skin? What is the stuff out of which a cutaneous language 
has to be built if it is to be optimally utihzable by the skin? Having 
ascertained what sensory dimensions there are, one is in a position to 
catalogue the possible discriminations relative and absolute, that 
might be made along such contmua This should yield building 
blocks, collocations of stimulus properties, for coding One would, 
indeed, be in possession of the stuff out of which all possible cutane 
ous communication s) stems could be constructed, provided only that 
the dimensional anal>sis were to be exhaustive and systematic Then 
it would depend mainly on what had to be communicated— whether 
the “message" was a warning, directional or rate information, a more 
50p]usticated and elaborate formal ‘language," or whatnot — what 
stimulus arrajs were selected out for coding and use 
There are but a few stimulus dimensions of the first order available 
They arc easily listed locus, intensity, duration, and frequency, if one 
begins with repetitive mechanical impacts that is, vibration This is 
not the only possibility — the skin responds to thermal, chemical, and 
electrical stimuli as well— but mechanical vibration is the most promis 
ing one if continuous signaling is contemplated There are also some 
derived dimensions, and we shall come to them later What of the 
first order ones? This is not the place to review all the psychophysical 
data. They are the joint products of a succession of workers m the 
Virginia laboratory, and most of them have been published (Celdard, 
1957) But we must see where we stand 


First-Order Dimensions 

Loras has never been systematically mvestigated though a good 
deal is known about .1 tough the stakes of Specter and Howell 
(cf Geldard 1957) The question of how many places “ e * 

can be utilized is not settled by extrapolation of the results of two 
pent esthesiometer measurements Unlike static pressure, . ™chan Kd 
v, brat, on applied to the skm does not stay iwtlun boun* unte 
special steps are taken to prevent its spread (Betey, ^ 

means that the two point hmen for vibration is many times grea 
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than the static one lor a given region Howell found that seven 
vibrators could be spaced on the ventral rib cage with 100-pcr cent 
identifiability of locus, under Ins conditions This is probably about 
the limit for a practical cutaneous communication system The chest 
accommodates five conveniently, and it is tempting — until one finds 
that it will not work — to consider the combinations of vibrators (31 
signals for 5 contactors) that might be coded The difficulty is that 
two or more simultaneously acting vibrators feel no different from 
one, once the static pressure of each has adapted out, and provided 
the vibratory pattern is set up in all of them with the same onset. A 
split second temporal differential is all that is needed to restore two 
local impressions, but this leads to the further complication that such 
a manipulation also provides the essential conditions for synthetic 
movement, tactual "phi” This, m turn, could be coded, of course, 
and we shall encounter it again later as belonging in the family of 
"derived” phenomena. 

The failure to explore the entire body for locus as a codable cue is 
not the result of neglect, nor yet of lack of interest in the outcome 
In this electronic age so many things have to await technological 
advance We simply have not had a transducer with the right prop- 
erties to make the experiment feasible When Dr Howell had an 
array of eight vibrators on his subject’s chest, he was literally trapped 
m a forest of concrete rooted supports, flexible goosenecks, and long, 
fiat springs that were necessary, with the vibrators then in use, to 
ensure independence of vibratory generation and to preserve a uni- 
form stabc background pressure. It is, therefore, a considerable 
satisfaction to report at this time a technological "breakthrough” of 
recent weeks whereby my colleague. Dr R C Bice, has succeeded 
m modifying radically a transducer of the hearing aid variety to pro 
vide strong, low frequency vibrations which are not readily damped 
by the skin He has also found an ingenious way of utilizing fabric 
fasteners to couple the vibrator firmly to any chosen site of stimula 
bon. The instruments are sufficiently small and light and their elec- 
trical properties are such that they yield high powers without undue 
beating. For the first time the systematic exploration of the dimen 
sion of locus seems to be m sight 

What of intensity? There have to be some limits Each part of 
the integument has its own absolute threshold, and each part has its 
suscepbbihty to both discomfort and damage by the trip-hammer 
achon of powerful vibrators In the region of the chest, a useful 
range of stimulus amplitudes is bracketed by the values, 50 to 400 
microns, the former because it is safely above the 100 per-cent abso- 
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lute threshold, the latter because it falls well below the threshold of 
discomfort between these limits the average observer can, under 
laboratory conditions and with the use of a careful psychophysical 
procedure, detect about 15 mtensitive steps On an ibsolute recogni- 
tion basis, unless one were to select subjects or train them, it would 
be unsafe to include more than three steps, widely spaced o\er this 
range The mtensitive dimension is, in fact, the least exploitable of 
all the first order dimensions An analysis of errors in a communica- 
tion system that codes locus, intensity, and duration shows nearly 
all mistakes to he made along the dimension of intensity There 
appear to be two chief reasons for this ( 1 ) a fixed amplitude applied 
to different loci varies considerably in its “feel,” owing perhaps to 
accidents of local innervation, perhaps to reinforcing or damping 
variations in underlying tissue, (2) it is not easy, in the face of breath 
mg motions and those coming from circulatory events, to maintain a 
strictly invariant relation between a mechanical vibrator and the shin 
surface on which it rests 

Duration of a continuous vibratory “package" is judged with some 
precision over the entire useful range The range must, of course, 
be selected on extraneous considerations In our experiments we 
have chosen not to deal with any durations under 0 1 second, tins on 
the ground that a “buzz” much shorter than tins is likely to be mis- 
taken for a '“nudge" or a “poke" At the other end of the scales vve 
have set 2 0 sec as a limit beyond which vve ore unlikely to wish to 
code signals, a communication system employing units lasting more 
than 2 sec is certainly a ponderous one Between 0 1 sec and 2.0 sec, 
then, there is a durational continuum within which tiie average ob- 
server can make about 25 distinctions, the steps being of the order of 
0 05 sec at the low end and 0 15 sec at the high end of the range This 
is again the relatively precise At of the psychophysical experiment 
Absolute identifications with 100-per-ccnt accuracy yield four or five 
considerably more widely dispersed lc\cb, and, if neither sthction 
of subjects nor training of them is intended, it is safer to uso only 
three. 

It is clear from the foregoing that three sets of building blocks of 
cutaneous communication systems arc in our possession a limited 
number of absolutely discriminate steps of locus, intensify, and 
duration There is, of course, a fourth primary dimension. This is 
frequency The story of frequency discrimination in the vibratory 
realm is not a simple one There is a history, and also some recent 
experiments have liad interesting outcomes. The history we may 
summarize by say mg that failure to control for differences ui subjec- 
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hvc intensity, when frequencies were being complied, anti for con 
ZZIZ transients nt lhe onset end offset of the stilus envelope 
has invalidated all measuns prior to those recently obtained by 
Genevieve Golf in our laboratory (Golf, 1959) She overcame these 
defects by first assembling a band of equal loudness stimuli, differing 



Fig 1 Differential frequency discrimination of mechanical vibration applied to 
the Engcr bp (Goff, 1959) The curves are for a single subject, 

in frequency, and (hen measuring Af systematically throughout the 
obtainable frequency range Her basic results are shown in Fig 1 
It is clear that, at very low frequencies, judgments of vibratory 
"rate” are quite good, but it js equally clear that discnmwabdity 
fades rapidly as the frequency scale is ascended. In the region best 
for speech sounds the skin does very badly indeed, and this finally 
explains why the “hearing through the skin” programs, alluded to 
earlier, yielded such disappointing results A. direct comparison be- 
tween the skin and the ear is made possible in Fig 2, the auditory 
data are the classical curves obtained in the Bell Telephone Labora 
tory (Shower and Biddulpb, 1931) at a roughly comparable intensity 
level. 

There is one additional difficulty where vibratory frequency js con 
cemed, were it not for this, it might conceivably be possible to trans 
pose audible frequencies downward into the tactile range The 
difficulty is this-~the correspondence between vibratory frequency 
and perceived “pitch” is a tenuous and uncertain one Vibratory 
pitch proves to be a joint function of both frequency and amplitude 
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To be sure, this is also the case, for much of the audible range at least 
,in /leanns as well, but intensity is only a very minor determinant of 
auditory pitch, making pure tones louder or softer can move pitch 
about only a little Frequency is very nearly m absolute control In 
toe cutaneous sphere things are different Increase the amplitude 
of a moderately loud 40 cycle sinusoidal vibration applied to the 



F, a 2 DiscnminabiLty of vibratory frequency (A/) compared with the auditory 
pitch discrimination data of Shower and Biddufph, 1931 (after Goff, 1959) 

fingex tip, and it undergoes a marked downward shift in pitch De 
crease the amplitude, and its rate goes up perceptibly B£k£sy has 
recorded shifts of the order of three octaves (B6k6sy, 1959) It is 
obvious that frequency would have to be handled gingerly in a com- 
munication system, especially if intensity were simultaneously mampu 
Iated as a variable 


A Workable Vlhratory System 

In our jminl efforts to form a system of vibratory signals capable 
of being coded, only the three obviously useful dimensions of locus, 
intensity, and duration were emploj ed The code shown in Fig 3 
was devised and applied successfully by Howell, so successfully that 
a subject who li vd invested a total of 30 hours m learning the alphabet 
of the "vibratcse" language could, after a further training period of 
only 35 hours, receive sentences with 90-pcr-cent accuracy when 
transmitted at the rate of 3S five letter words per minute (Howell, 
1956) 

Tins performance in no wise represents the optimal att unable 
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Using the same rules as those of international Morse code mil. re- 
spect to interword and intraword spacing (01 sec and 00a sec, 
respectively), the system requires only 079 sec to transmit the 
"average" Eve letter English word This means that the ceding 
transmission rate is 67 words per minute, a speed well over three 
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tunes that o£ proficient Morse To approach this rate, however, there 
would be required an extensive and somewhat elaborate engineering 
job to devise an automatic coder — perhaps of the tape variety — that 
would initiate signals faster than our present homemade machine, a 
manually operated typewriter that triggers “flip-flop” circuits (for 
time) and a bank of potentiometers (for intensity). The closing of 
the gap between 38 and 67 words per minute we leave to those inter- 
ested in establishing world’s records 


Future Potentialities 

Meanwhile what other building blocks are available 9 We have 
considered only the first-order dimensions of the vibratory stimulus 
There are some derived dimensions, a few of which have already 
received some attention. It would be helpful to have oddibonal 
codable cues, if only to be in a position to add redundant elements 
Much has been written about purifying and simplifying languages by 
reduction of redundancy Where intelligibility is Iks than optimal 
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there is much to be said for making the language more, rather than 
less, redundant (Miller, 1951, p 103), and the vibratese language 
would doubtless benefit from this Lind of doctoring 
What are the candidates here? Intensity variations as a function 
of time present one set of possibilities A signal may be imposed on 
the skin quite abruptly or more gradually, just as m music one may 
have variations in “attack, “hitting” a tone or ‘sliding into” it Sys 



temabc manipulation of tbs variable has been earned out by Howell 
Figure 4 serves to describe both how the envelope shapes were made 
to w and how discnmmabihty of rate of onset changed watt, dc 
* i ittarJc For a growth of amplitude from zero to 
E - ™ practicable to w oik with-and 
f f “ ranacd from the shortest that the system could 

for nso tun“ t the longest that might serve for sig- 

produce be sue derernmnable steps As with 

STS SSS these come from the content.™ of psy- 
att orner simian ldenUflc ahons are, of course, not spaced tills 

rloif]y S1 Indeed if 100-per-cent recogmzability of 'attack rate is 
closely M • c of onsct ,| 0I , CS them are but two the 

"i number' pro- to be 7 minus S! There arc presumably 

rrzs&szttx *-* — - - 
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torn) This lias not been investigated yet, though it is clear that 
wave form variations should be discnminablc if the basic frequency 
is low enough Still other moment-to moment variations should be 
detectable Perhaps it would be possible to introduce variations 
within signal envelopes other than those of onset and offset, a gradu- 
ally increasing or decreasing frequency, say This is an unexplored 
field. 

Another patterning of stimuli involves space as well as tune As 
m vision, the successive stimulation of two separated receptive areas, 
provided the temporal relations are right, leads to perceived move- 
ment Unlike vision, the critical time relation for tactual "phi” is the 
absolute interval between the beginnings of the two successive ex- 
posures, not the duration of the silent interval betw cen them Indeed, 
there need be no silent interval at all The exposures may overlap 
and still yield good movement if the onsets are properly spaced. 
Cutaneous phi has been surveyed, not so much with a view to coding 
it and incorporating it into a communication system, as to studying 
its essential conditions as a phenomenon, but clearly it offers some 
possibilities 

It is doubtful whether vibratory phi should be coded into any lan- 
guage of the type described above Indeed, m any rapid succession 
of signals applied to spatially discrete loci, the crucial conditions for 
synthetic movement are already given Movement has to be over- 
looked, not apprehended in this situation, of course On the other 
hand, the phenomenon of cutaneous movement is an extraordinarily 
powerful one, it really commands attention Place a ring of vibrators 
around the body, for example, three across the front and three across 
the back of the thorax, energize them successively with a 0 I sec tem- 
poral separation, and the observer feels a vivid "swirling" motion, 
entirely novel in his experience, because he seems to be at the center 
of ltl This effect is a completely prepossessing one and, accordingly, 
it is ideal for coding m a different fashion, say, as a warning signal 
to be infrequently used but capable of highly significant coding It 
would do valiant service attached to a “panic button ” 

This raises the important question of the ultimate place of the cuta 
neous channels in the total communication picture Coding to letters 
and numerals is really a quite pedestrian way of getting meanings 
into tactile patterns There are, to be sure, obvious ways of making 
such a system “fly” at a faster rate One way would be to code the 
vibratory signals to phonemes We have not attempted it because of 
the prodigious investment entailed in learning the phonemes them 
selves, but it ought to be tried It is also possible that there may 
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come out of hiding an entirely novel cutaneous shorthand, one capi- 
talizing on distinctively tactile properties Senous study of basic 
cutaneous perceptual phenomena, an area dignified by the devotion 
of not more than a dozen first rate minds m the whole of recorded 
history, might turn up such a linguistic development 


Vibratory Tracking 

Tile possibilities of cutaneous communication are by no means con 
fined to conventional language, of course Other lands of information 
may be imparted tactdely Rates, amounts, directions — anything 
falling on umdimensional or bidimensiona 1 contmua could presumably 
be represented to the shin by means of suitably patterned mechanical 
impacts or sequences of them One of these possibilities has already 
been exploited in our experiments Vibratory tracking by compensa 
tory pursuit has been carried out by lining up tliree vibrators across 
the chest and letting them be successively energized to give the 
impression of continuous movement in one direction or the other 
( through utilization of "phi”) The "arrowhead” was always ' pointed” 
toward the target, and the vibratory sequences were temporally 
spaced to indicate degrees of urgency in getting back “on target ” 
The subjects manipulated a steering wheel and attempted to elimi 
nate all cutaneous signals by promptly neutralizing all off target 
indications 

Ten subjects performed in response to these tactile signals Ten 
others were presented the visual analogue, with lights substituted 
directly for the vibrators Both groups learned rapidly, and the vibra 
tory performance was in no wise inferior to the visual. Comparative 
learning curves are shown m Fig 5 Although the visual conditions 
are not optimal for this sense-the target was "traveling” it the rate 
of only 3 5 degrees per second, and die eye of course, can handle 
speeds many times as great— the tracking task imposed on the subjects 
was one that would keep all but the speediest vehicles comfortably 
on course, and the skin was handling the assignment fully as well 
as the eye _ , 

Subsequently, this experiment underwent simplification Synthetic 
movement was talen out of the display, the three vibrators were 
reduced totwo and even the "urgency* feature was eliminated iNow 
there occurred only a simple "nod go,' a brief burst of 60-cjcJe vibra 
lion to indicate direction off the target to the right or left. Per- 
formance showed no significant degeneration with tins removal of 
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redundancy in the signal Very little is needed to give directional 
information that is adequate to fairly complex performance 
Currently there is being tested a bidimcnsional vibratory display 
designed to give both "right left” and “up down” deviations from the 
target (glide path? trajectory?), and there is every indication that the 
system wiU do what is expected of it The value for situations in 
which vision and hearing are pro empted, for one reason or another, 
is obvious 



Fig 5 Comparison of visual and vibratory tracking performances The error 
scores were weighted for distance off target (Bice, 1953) 

Another whole domain of possibilities opens up when we consider 
that only one form of energy, the mechanical, has thus far entered 
our calculations Not much is to be hoped for from the chemical 
and thermal forms of stimulation They are both too ponderous in 
their operation to be of much use m communication, at best they 
could provide only the analogues of smoke signals But there is the 
whole important realm of electrical stimulation The skin responds 
with lively patterns to both direct and alternating current Indeed 
the heart of the problem is that the patterns are, in general, somewhat 
too lively for comfort We have devoted several years of intensive 
effort to Ending the conditions of electrical stimulation that will yield 
codable vibratory patterns bereft of pain, and that can be reproduced 
on demand The problem has turned out to be a slippery one Elec- 
tricity is the great “nonadequate” stimulus, it triggers everything, 
as physiologists well know However, the important sbmulus param 
Cters are few in number, and my colleague. Dr John F Hahn, has 
been able to isolate the really significant one in skin stimulation 
(Hahn, 1958) As Fig 6 taken from his work shows, where square 
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waves are employed and are systematically varied in frequency and 
duration, thus obviating any influence of the change in rate of current 
increase such as occurs with alternating current of variable frequency 
it turns out that absolute threshold is related to duration only. This* 



Duration (msec) 

Fig. 6 . Strength duration curves at several rates of repetition of direct square- 
wave stimulation of the skin. Absolute threshold /base line is the ratio of absolute 
threshold to the mean of thresholds with pulse durations of 20 msec or longer, 
pps is pulses per second {Hahn, 1958), 

fundamental discovery permits us to narrow our search for the basic 
medium of reception m electrical sensitivity of the skin. The great 
stumbling block, thus far, is the omnipresent pain. Although under 
some reproducible conditions pain tends to adapt out in continuous 
signaling, leaving behind a not too unpleasant tingle, it is doubtless 
asking too much, in practical communication situations, to expect that 
even transient discomfort from a transducer will be tolerated. 
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Summary 

\ really comprehensive approach to communication problems en- 
t.nil s not only the domains implied by the terms, “stimulus, physio- 
logical changes,” and “response,” and the three sets of relationships 
into which they enter, but the much more remotely related terms, 
“physical objects” at the beginning of the senes, and “conduct” at the 
end of it The position one takes on these relations has much to do 
with how questions concerning communication are to be asked 
There axe reviewed the currently available facts concerning cuta 
neous sensibilities, insofar as they have bearing on the communication 
problem. The possible elements of a cutaneous language are iden 
Ufied as those collocations of stimulus properties that are recognizable 
in the absolute The first-order dimensions of mechanical vibration 
(locus, intensity, duration, frequency) provide the major possibilities, 
though interactions between frequency and intensity are such as to 
render one or the other of these difficult of utilization. \ few steps 
of locus, intensity, and duration provide enough codable signals to 
make rapid and accurate communication possible 

In addition to the primary dimensions, there are some secondary 
or denied ones Intensity variations as a function of tune (“attack”) 
constitute one such possibility A second dimension is the derivative 
of frequency, w av e form. A third involv es both spatial and temporal 
variations, synthetic cutaneous movement. The first and third have 
already received some attention, the second has not been much mves 
fa gated as jet. Movement (cutaneous “phi”) is especially useful in 
unusual warning situations because of its vividness It ha<; also served 
as an mtensifier of signals in tracking situations. 

\ second contender for prominence in future cutaneous comm uni ca 
lion systems is electrical stimulation. The possible stimulus par am 
eters are few m number, and recent work bac shown pulse duration 
to be the important one in sensitivity measurements Direct electrical 
stimulation of the slan minimizes the transducer problem and is ultt 
mately to be preferred over mechanical systems if the ubiquitous 
problem of pain can be circumvented. 
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Summary 

A really comprehensive approach to communication problems en- 
tails not only the domains implied by the terms, “stimulus,” “physio- 
logical changes,” and “response,” and the three sets of relationships 
into which they enter, but the much more remotely related terms, 
“physical objects” at the beginning of the series, and “conduct” at the 
end of it The position one takes on these relations has much to do 
with how questions concerning communication are to be asked 
There are reviewed the currently available facts concerning cuta- 
neous sensibilities, insofar as they have bearing on the communication 
problem The possible elements of a cutaneous language are iden- 
tified as those collocations of stimulus properties that are recognizable 
in the absolute The first-order dimensions of mechanical vibration 
(locus, intensity, duration, frequency) provide the major possibilities, 
though interactions between frequency and intensity are such as to 
render one or the other of these difficult of utilization A few steps 
of locus, intensity, and duration provide enough codable signals to 
make rapid and accurate communication possible 

In addition to the primary dimensions, there are some secondary 
or derived ones Intensity variations as a function of time ("attack”) 
constitute one such possibility A second dimension is the derivative 
of frequency, wave form A thud mvolves both spatial and temporal 
variations, synthetic cutaneous movement. The first and third have 
already received some attention, the second has not been much inves- 
tigated as yet Movement (cutaneous “phi”) is especially useful in 
unusual warning situations because of its vividness It has also served 
as an mtensifier of signals m tracking situations 
A second contender for prominence m future cutaneous communica- 
tion systems is electrical stimulation The possible stimulus param- 
eters are few m number, and recent work has shown pulse duration 
to be the important one in sensitivity measurements Duect electrical 
stimulation of the skin minimizes the transducer problem and is ulti- 
mately to be preferred over mechanical systems if the ubiquitous 
problem of pain can be circumvented 
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Selected Developments 
in Psychophysics, with Implications 
for Sensory Organization 


The purpose of this chapter is to outline some selected recent devcl 
opments in psychophysics that may be relevant to problems of sensory 
organization In this discussion it is assumed that psychophysics is 
concerned primarily with “terminal” activities, in that specified sensory 
inputs are related to specified behavioral outputs, without direct ob 
servahon of the dazzling complexity of intermediate neural events 
Psychophysical data are thus necessary, but not sufficient, for a com 
plete model of sensory organization From this point of view, psycho- 
physics will be considered to suggest crude alternative models, or 
broad strategies of action, for approaches to the sensory organization 
of the human observer, rather than crucial details All references to 
the nervous system in the chapter refer to a conceptual nervous system 
that serves between the terminal activities of stimulus and response. 


Psychophysics and Information Measurement 

The outstanding impact of information theory— more accurately, of 
information measurement— upon psychophysics has been the specifi 
cation of the stimulus (Quastler, 1956) Instead of the specification 
of a particular energy configuration, information measurement has 
called for the specification of the entire ensemble of possible events 
to which the organism might be exposed in a particular situation 
Although minor contextual effects ha\e long been known to psycho- 
physics the concept of the ensemble of possible stimuli far transcends 
second-order effects In the light of information measurement, the 
entire ensemble of events becomes, in effect, a superordinate stimulus 
S9 
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With the same impetus, the ps>cho physical behavior of interest 
becomes the stimulus response confusion matrix which relates each 
of the possible responses to each of the possible stimuli (Miller and 
Nicely, 1955) The stimulus response confusion matrix m turn is 
cru cia l m testing alternative models of sensory organization These 
models consider the representational mapping of stimulus objects on 
the relevant portion of the conceptual nervous system This relevant 
portion will be termed the discrimination sjiacc In particular, two 
extreme alternative models will be further considered in detail 

The fixed-map mode! 

The fixed map model of sensory organization implies a complete 
lack of interaction among the alternative stimulus objects of the en 
semble (Luce, 1959) The model implies that the mapping of any 
specific stimulus object onto the conceptual discrimination space is 
invariant with the other members of the stimulus ensemble That is 
to say, corresponding to each stimulus object there is a location in the 
discrimination space, and the location associated with a particular 
stimulus object does not depend upon the other members of the 
ensemble 

In terms of psjchophysical results, the model suggests that any 
stimulus of the ensemble is effective only insofar as the stimulus con 
tributes responses to the total pool of responses Removal of the 
stimulus and redistribution of its corresponding responses within the 
re mainin g stimulus response confusion matrix leave the ratio among 
specific stimulus-response confusions unchanged Hence, the pre 
dicbve instrument for the fixed map model has been termed the 
constant ratio rule (Clarke, 1957) It should be noted that the model 
makes no assumption of the distribution, or of the scalar properties, 
of the objects of the stimulus ensemble 

Rubber-band model 

\n altemahv e model of sensory organization assumes a high degree 
of interaction among the members of the stimulus ensemble It 
assumes a variable mapping of the stimulus ensemble onto the appro 
pnate discrimination space, depending upon the available range of 
stimulus objects m the ensemble The discriminative density, in turn, 
is inversely proportional to the range The change m density with 
range may be illustrated by marking off equal distances on a rubber 
band, and then counting the number of marks as the band is stretched 
and contracted, with reference to a fixed length. From this illustra 
tion, the model gets its name the rubber band model of sensory 
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organization (McGill, 1958) In terms of the model, any specific pair 
of items may be highly discnmmable if the entire range under presen 
tation is narrow, but poorl> discriminated if the entire range is wide 
The rubber band model, however, dots not jicld exact quantitative 
prediction unless we assume that the discrimination space is alwa>s 
stretched to accommodate tlie entire range of the stimulus ensemble 
Tins assumption is unreasonable, for it would result in the prediction 
that the discnmmability between any two stimuli, of an ensemble 
of only two stimuli, would bo independent of the differences between 
the stimuli Tins is certainly not the case Further, whereas the 
rubber band model may reasonably be applied to stimulus ensem 
blcs that may bo arranged along a single physical continuum, it is 
not evident how the rubber band model would accommodate stimulus 
ensembles that vary along several physical con tin ua. These points 
suggest that predictions based on the rubber band model might be 
successful on!> for stimulus ensembles that vary with respect to a 
single physical variable, or a small number of variables 

Confusion-matrix results 

Several empirical studies have uow shown that, for stimulus en 
semblcs of a high order of physical complexity (for example, words 
presented in noise, letters presented in a bnef flash), the constant 
ratio rule is an excellent predictive instrument, even for relatively 
small confusion matrices ( Clarke, 1957, Clarke and Anderson, 1957) 
Said otherwise, with these stimulus ensembles the fixed map model 
appears to lead to precise prediction of psychophysical data. 

Stimulus ensembles that vary with respect to only one variable do 
not yield clear evidence Preliminary data (Clarke, 1959) suggest 
that large deviations from the constant ratio rule (in the direction pre 
dieted by tlie rubber band model) are observed for stimulus ensembles 
in which only a single variable is manipulated However, these devia 
tions are observed' only with small’ concision nnrfrnasr umf umV * mlvir 
the stimulus objects of the ensemble are highly discnmmable For 
larger confusion matrices, or for small confusion matnees in which 
the stimulus objects of the ensemble are highly confusable the 
constant ratio nile is an excellent predictor of the psychophysical 
findings, even for stimulus ensembles that vary with respect to a single 
variable (Hodge and Pollack, m preparation) 

Thus the concept of the stimulus ensemble, denved from informa- 
tion measurement, with the associated stimulus response confusion 
matrix permits examination of alternative models of sensory orgamza 
hon The weight of evidence favors a model that assumes dial stimu 



94 IRWIN POtlACK 

background On the other 50 per cent, there is no additional flash 
The detection system consists of a photocell sensor and a response 
threshold detector, such that if the output of the photocell sensor 
reaches a critical response level, the system responds "flash , and, if 
the sensor fads to reach the critical response level, the system re- 
sponds “no flash.” Let us assume that, on successive occasions, the 
response threshold detector may be set to different critical response 
levels 

If the critical response level of the threshold detector is set extremely 
low, the photocell system "sees” nearly all the flashes, that is, it regis- 
ters the presence of the flash upon nearly all the trials in which the 
flash has been presented In the language of signal detection, a high 
“hit” probabdity has been achieved The price that must be paid for 
setting the critical response level low is that the photocell system 
may register often the presence of a flash on trials in which the flash 
was not presented. In the language of signal detection, a high "false 
alarm” probabdity would be obtained Thus, when the critical re 
sponse level is low, a high hit probabdity is associated with a high 
false alarm rate 

Similarly, if the critical response level of the threshold detector is 
set high, the system may fad to report many of the flashes on trials 
m which the flash was, indeed, presented And, in the same manner, 
the system would rarely register the presence of a flash on tnals in 
which the flash was not presented. That is, when the critical response 
level is high, a low hit probabdity is associated with a low false-alarm 
rate 

Note that, with the same photocell sensor, a wide range of hit 
probabilities, with an associated wide range of false alarm probabih 
hes, may be obtained simply by varying the critical response level 
of the threshold response detector operating upon the information 
furnished by a sensor of the same photocell system Should we now 
conclude that under these conditions the photocell sensor is changing 
m its sensitivity or discrimination of the flash above the background 
illumination? No, it is not reasonable to infer changes in the behavior 
of the photocell sensor, for its characteristics are determined by its 
owrn construction relative to the sensing environment under examina 
bon. It is rather more reasonable to assume that the discnminabdity 
of the photocell sensor is constant and that the operating level of the 
decision part of the system is changing. This distinction between the 
discnminability of the sensory part of the system and the operating 
level of the decision components of the system, acting upon the sen 
sory information, is cruciaL 
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What his been said for the electronic photocell system can also be 
said for the human discriminator Under a given set of viewing con 
ditions of observation, a wide range of operating levels may be ob 
tamed by varying the instructions to the observer or by varying the 
relative values and costs of missed detections and false alarms In 
fact, it is possible to show that, under a specific signal and noise condi 
tion, the discnminabihty of the human observer is invariant over a 
wide range of operating levels (Egan, Schulman, and Greenberg, 1959, 
Pollack and Decker, 1958, Tanner and Svvets, 1954) 

What are the implications of the signal detection approach for 
sensory organization? The primary implication is that, whereas a 
response threshold detector is reasonable, a sensory threshold detector 
is not tenable The experimental evidence indicates strongly that the 
sensory information about the environment is preserved in a form 
such that a continuous range of information is available for subse 
quent decision making 

The negation of the sensory threshold is of course, a radical notion — 
so radical as to invite immediate suspicion In the comment, "Is There 
a Quantal Tlireshold?” (in this volume), Stevens takes an opposite 
position He points out that, in every experiment carried out within 
the framework of signal detection, noise has been introduced with 
the signals to the observers He notes that the introduction of noise 
is alone sufficient to mask the "sensory neural quantum" (Miller, 
1947) Experiments with pure tones (Stevens Morgan, and Volk 
mann, 1941) may, or may not, yield the sensory quantum Further, 
it is argued that the forced choice procedure of the signal detection 
approach places an unnecessary burden on the observer which tends 
to mask the sensory quantum Much heat can be generated by argu 
ing these points At present, however, it seems more profitable to 
determine whether we can set up an experimental test between the 
statistical detection and the sensory quantum approaches The fol 
lowing set of expenmen ts might provide such a test if a number of 
assumptions can be met 

First we recreate the conditions of the initial Stevens, Morgan, and 
Volkmann experiment, in which the "neural quantum” was observed 
We assume that we are successful in obtaining a fortunate selection 


of observers capable of maintaining an unwavering attention over 
extended periods of tune” so that quantal functions can be obtained 
We also assume that we can get the listener to make a confidence 
rating of each of bis responses without disturbing the quantal func 
bon (For a discussion of the use of confidence ratings see Egan, 
Schulman, and Greenberg 1959, Pollack and Decker, 1958 ) And. 
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finally, we assume that we can interpose a small number of "catch 
trials," in which the signal is not presented, without disturbing the 
quantal functions of the observer 

If the neural model is represented by a discriminative conUn uuui, 
the listener’s confidence ratings should reflect his accuracy of judg- 
ment on the critical trials On the other hand, the sensory-quantum 
approach predicts that the listener’s confidence ratings will not reflect 
any difference, owing to the quantal nature of the changes In this 
regard, it may be noted, the original article reported that large in- 
tensity increments often “sounded louder” than small ones (Stevens, 
Morgan, and Volkmann, 1941, p 334). 

Of course, the experiment cited above may not be feasible As 
Stevens and others (Miller and Gamer, 1944) have noted, any dis- 
turbance of the observer often results m the fadure to obtain the 
neural quantum. Adding the confidence response may vitiate the 
entire experiment Still the approach may warrant a senous attempt 
The theory of signal detection has another— and potentially more 
important-implication for sensory organization The theory defines 
an i deal observer. The ideal observer is a performance model that 
is able to extract all available information from the signal and noise 
environments (Tanner and Birdsall, 1958) The ideal observer sets 
an upper limit on the discnmmability of the observer under examina- 
tion— whether human or electronic. 

de “°“ tote ^ *= human observer performs poorly 
rekhve to the ideal observer m the detection of weak signals m noise. 
Here the ideal observer can use phase information and other classes 
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able to make room for such flexibility in considering the molar prop- 
erties of the sensory organization of the human observer. 


Summary 

Two recent contributions to psychophysics-information theory 
and signal-detection theory— have been briefly considered insofar 
as they may point toward molar properties of sensory organization 
of the active human observer. Psychophysical data, which reflect 
only stimulus-response correlations of the molar observer, by their 
very nature cannot be expected to reveal the details of the actual 
neural substratum of sensory organization. 
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COHN CHeHRY 

Imperial Collage, Umveriiljr of london 


Two Ears — but One World 


Tho Formation of Binaural Gestalten 

I bclie\e it was Epictetus the Stoic who was reported as saying, 

God gave man two ears, but only one mouth, that he might hear 
twice as much as he speaks ” 

This wishful thought may be empty, so far as human wisdom is 
concerned, but it is curiously near truth on the plane of psychophysics 
For the possession of two ears gives us greatly enhanced powers of 
aural discrimination we can the better separate a single voice in a 
bu2z of conversation and attend to it, or we can single out a voice 
from the sounds of traffic, or of the wind, and all the myriad of dis- 
turbing noises The brain takes maximum advantage of the slight 
differences between the signals that reach the two ears — differences 
in tuning, in intensity, and m microstrvcture—and by processes of 
inductive inference breaks down the complex of sounds into separate 
coherent images, Gestalten, which becomes projected to form the 
subjective "spatial world” of sound 

The basic fact about any one of these images, say that of a voice, 
is that it is single, with two ears we hear only one world By what 
logical processes does the brain examine and analyze the sense data 
reaching the two ears, so as to achieve this fusion? Understanding 
of the fusion process seems to be fundamental to an understanding 
of directional hearing Understanding of the logic of the processes 
whereby voices in a crowd can be separated is a parallel problem, 
for we can separate voices, and other Gestalten, with one ear alone 
but with reduced effectiveness 

In this chapter we shall be concerned only with this basic phenome 
non of fusion, discussing it as a psychological problem and its de 
scnption as a logical (mathematical) problem Physiology will not 
be mentioned, for we shall not look inside the held at all On the 
contrary, we shall try to set up a model to describe what "must” he 
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there Before you physiologists charge me with arrogant presump 
toon m saying this, I should ask you to remember two things First, 
we are not saying what physiological mechanism must be there, but 
only by what logical principles it can be described as carrying out its 
functions (for example, whether it adds or multiplies or integrates), 
second, if you disagree with any aspects of our mathematical model, 
it is up to you to show that the logical operations are physiologically 
impossible If you succeed, we on our side will retract and re examine 
our logic (or, more likely, our premises) 

The model of binaural fusion, to which I refer here, has been de- 
scribed elsewhere in complete detail (Sayers and Cherry, 1937) It 
is largely the result of the extensive experimental and computational 
work of Sayers while he was a student in my group at Imperial 
College, London* 


To Fuse, or Not to Fuse? 

In order to study fusion, as opposed to directional hearing (Cherry 
and Sayers, 1950), we eliminate the effects of head turning by fitting 
the subject with headphones If the two earphones are driven with 
identical signals of any kind, the subject hears a single fused image 
located in the center of his head If the interaural time difference 
T t , or the relative intensity Al/A r , of the left and right hand signals 
is varied, the image appears to move across the head laterally m a 
line between the two ears The image does not appear to pass out 
side the head and has no angular direction, as in real life, it has only 
lateral position, left to right 

We control the lateralization by varying the interaural time interval 
T, only, driving each earphone from a separate reproducing head 
on a special magnetic tape recorder, running at high speed The 
interaural time interval T, can be set to any value to an accuracy of 
less than 20 microseconds 

The interval T, is set to a succession of random values (using a 
table of random numbers), and the subject is required to guess 
whether the fused image appears to he to the right or left— a dichoto 
mous judgment, R or L The forms of such lateralization “judgment 
curves’* fall broadly in to two classes (1) If the source is predomi- 
nantly random, or quasi random, the form is like that shown in Fig la 
When T, is large (say 1 to 5 milliseconds), the subject answers 100 
per cent correctly, when small, he makes errors, when zero, he is (by 
chance) correct about 50 per cent of the time (2) If the source is 
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real life sources of sound contain both (quasi ) periodic and stochastic 
components , 

We shall concentrate first on sources of type 1— for example, speech 
or noise 

This downward dip in Fig la represents the subject’s uncertainty, 
which has at least two origins (1) since the fused image has a 
finite subjective size, its central point is uncertain, and (2) the subject 
is uncertain of his own midplane (intracranial) about which he is 
mahing acoustic judgments 

The arbitrariness between zero values of T* on the magnetic tape 
and the central (intracranial) plane is removed by replotting Fig la 
in terms of percentage of right hand judgments, as m Fig lc, thereby 
skewing the curve This method is adopted hereafter for all forms 
of “judgment curve ” The fact that T e = 0 now is not to be taken 
as “an applied central image” Note that Fig lb is plotted this way 
also 

So much, briefly, for the case of identical aural stimuli If, how 
ever, the left and nght signals, S t (f) and S JC (t), are utterly different 
(statistically independent), say, two different voices, or different 
noises, then no fusion takes place (Cherry, 1953) What we are here 
regarding as a “fusion mechanism” is not triggered This mechanism 
is then operated, or not operated, as determined by the relation 
between S t(t) and S B (f) 

It has been our purpose to examine experimentally the nature of 
this critical relation and to find a model for the analytical proc 
esses whereby the brain determines it, as a control for the “fusion 
mechanism ” 

Obviously it is far more interesting to see what happens when S t (f) 
and S B (f) axe neither identical nor totally unlike, but only partially 
alike In what sense is “alike” to be defined? What does the brain 
look for m its running analysis? 

If we were to confine ourselves to the use of simple determinate 
signals, like sine waves, clicks, and so forth, we might oversimplify 
the problem For m real life the aural stimuli, such as human speech, 
street noises, and so on, are stochastic “Stochastic” means "proba- 
bilistic”, the brain cannot know in advance exactly what is coming 
to the cars next, in microscopic detail instant by instant Further 
than this the two signals S,(t) and S n (t) are never exactly alike, in 
detail, in real life They differ not only in timing T e and average in 
tensity At./ A a owing to the spatial directions of sound sources, but 
m other wa>s owing to head sound shadows, reflections and rever 
be rations, and random sound contributions from the wind on our faces 
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Again, m real life, an aural image stays fused and does not hop 
about as the two aural signals fluctuate It must be average statis 
heal, invariant properties that control this “fusion mechanism, and 
several different storages may be used, for different averaging times 
Mathematically speaking the measures that assess the average 
(statistical) degree of dependence, or independence, of stochastic 
sources are the correlation functions There are different forms of 
such measures but certain other real life conditions under which 
the brain must operate, narrow down the possibilities Such func 
tions have found a considerable place in models of other aural phe- 
nomena, notably in the work of Lichhder (1951 1956), especially in 
connection with pitch perception and spahal separation but not, so 
far as we can find, in models of bmaunl fusion prior to our 1956 
paper (Cherry and Sayers, 1956) We shall be returning to such 
theoretical aspects in a later section 


Experimental Control of the Statistical Independence 
of the Two-Ear Signals 

Before theorizing further, let us refer to some experiments Figure 2 
shows the same binaural stimulus arrangement as resulted in the 
curves of Fig 1, except that now band limited white noise is added 
to the signal at one ear only The magnitude of this noise N(t) in 
relation to that of S(t) controls the statistical independence of the 
two ear stimuli to any degree we wish 



Figure 3a shows a typical judgment curve (percentage of left judg 
meots) as T. is set to successive random values when S(t) is male 
speech Tins curve differs from that of Fig 1 in two vva,s (1) Dips 
appear, showing less oertamty of left htcral.zat.on of the image at 
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certain values of time T r . Fusion is now only partial over various 
regions of interaural delay T,. (2) A total left-right dissymmetry 

shows total fusion when the pure speech signal leads in time but partial 



(«> 



/Jg. 3. The binaural fusion “precedence effect - *, (a) typical judgment curve 
uung male speech masked by noise In left ear only {S,(t) pure speech, S*(<) 
— noisy speech], (b) typical judgment curve, using noise for left ear and noisy 
noise for right ear (theoretical curve dashed). 


fusion when the noisy speech leads. This we have referred to as a 
type of precedence effect (Cherry and Sayers, 1958). 

This new “precedence effect" seems to us to have important impli- 
cations, and it will be referred to again in the next section. 

My colleague Dr Sayers has made extensive measurements of sud» 
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binaural “judgment curves," each the result of one to two thousand 
separate right left judgments with randomly set tune delays Partly 
from such curves the nature of tire analysis performed by the brain 
(in the “fusion mechanism”) has been exposed This analysis is re 
qrnred, in part, to determine the degree of statistical independence 
of the signals that arrive at each ear in real life, since this assessed 
degree of independence is a measure of the likelihood that these two 
signals come from a single external source of sound If this likelihood 
is higher than some threshold (not necessarily constant), fusion is 
enforced, if lower, separate images are set up at each side of the head 
If the logic of such argument is accepted, we are led to the con 
elusion that the fusion mechanism” can be described as carrying out 
some type of correlation analysis, though there can be many forms 
of sucb measures To determine the precise form used by the "fusion 
mechanism,” further experiments were performed using not only 
stochastic signals like speech and noise but determinate ones such 
as sinusoids We shall return to these experiments later, m the section 
on Binaural Fusion of Speech. But first let us return for a moment 
to the curious ‘precedence effect,” already noted. 


The “Precedence” Assumed fay a Pure Sound Source 
at One Ear over a Noisy or Distorted Source at the Other Ear 


There are other “precedence effects" in hearing, but the one referred 
to earlier and illustrated in Fig 3 has not beeD reported, so far as we 
are aware, prior to our 1956 and, m more detail, our 1957 papers 
(Cherry and Sayers, 1956, Sayers and Cherry, 1957) 

The effect was mentioned only in passing in these papers, and the 
writer would like now to take the opportunity of calling the attention 
of psychologists and neurophysiologists to it, since there may be 
important implications 

With reference to Fig 3 the effect is this if the pure signal S B (t) 
(say, speech) is set earlier in time than the noisy signal Sa(<) + N(f), 
then total fusion takes place, with zero uncertainty concerning its 
right left localization, over a vv.de range of signal to none ratios, pro- 
vided the delay T. is greater than some residual amount (about ICO 
microseconds ... practice), but if the noisy signal is earlier in time, 
fusion is quite varied and uncertain This total dusymmetry is illus 
trated by the judgment curve of Fig. 3u 

In our earlier publications (1956, 1957) no explanation o this effect 
was offered It was examined over a wide range of conditions, how- 
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ever, with not only white noise masking m one ear but also various 
distortions (for example, severe amplitude clipping) 

What distinguishes a “pure” signal from a noisy or distorted one, 
thereby giving precedence to the former? Our first reaction was to 
assume that, in the case of speech, the listeners grcit experience of 
speech gives him prior information (and thus a set toward pure 
speech) But experiments soon showed the effect to exist, equally 
strongly, with sources other than speech (such as combinations of 
sine waves of different frequency, and many others), of which we 
humans have far less pnor knowledge Finally, we can destroy all 
effects of pnor knowledge by using, as the binaural source S(f), 
another wide-band white noise source, independent of the added 
noise source N(t)* 

Figure 3b shows a typical curve, which is quite asymmetrical about 
r,~0 Notice, in this case, that the “noisy noise” S (t) + N(t) has 
taken precedence over the “pure noise” S(f) 

Clearly, then, the inference process by which the brain (fusion 
mechanism) discriminates between the two signals does not neces 
sarily require pnor probabilities (though with speech or other com 
mon signals such pnor knowledge may conceivably enhance the 
effect) Recently H B Voelcker (of our laboratory) has given a 
complete theoretical explanation of the effect, unfortunately, at the 
time of writing, this has not appeared in pnnt, so no reference can 
be given. Without anticipating its appearance, we might merely point 
out that the signal S*(i) is “contained within” the noisy or distorted 
signal S L (t) +N(t), but that the converse is not true Voelcker has 
examined the nature of the relation "contained within” mathemati- 
cally, concluding that this precedence effect requires only a simple 
correlation analysis mechanism, which can be of the type that we have 
described as the basis of binaural fusion (Sayers and Cherry, 1957), 
and which is outlined m the section on The Model of the Binaural 
Fusion Process in the present paper In other words he observes that 
the processes adopted in our model are all that are strictly needed to 
explain the effect 


Binaural Fusion of Speech 

A great deal of auditory analysis is earned out with simple determi- 
nate signals, file clicks and pure sine waves Speech, which is the 

- u P<3ilap * w ' e ® vc a psychological definition to white noise, as being 

, U£Iul ^ ''hich the subject can has c no pnor microscopic knowledge (but 
Wily statistical knowledge), which presents him from making prediction* for a 
Umc ahead greater than the Nyquut Sampling Time.” 
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class of signal of greatest importance in real life, has a very complex 
structure, but this does not preclude its use for such experiments as 
we are considering here On the contrary, we regard as essential the 
use of speech and other stochastic signals, when attempting to build 
models of aural mechanisms — which, after all, have e\ olved around 
such natural sources 

The usual and most effective way of dealing with such complex 
"stochastic” sources as speech is to use statistical (average) methods 
and measures The method we have adopted, for studying binaural 
fusion, is of the type illustrated by Figs 1 and 3, our original paper 
(Sayers and Cherry, 1957) contains many more 

It is from the parameters of such statistical data that the parameters 
of the binaural fusion mechanism may be inferred But such param- 
eters must not be expected necessarily to remam mvanant as the type 
of signal is changed (or the noise of the environment or other condi- 
tions). This variation we have found to be the case, in certain ways 
But it is not the fusion mechanism that seems to change, rather it is 
extremely flexible in the way it handles limitless variety of binaural 
signals — speech, with its transients, its quasi random breath noises, 
its quasi periodic formants as well as all the myriad of other sound 
sources that surround us m daily life Rather it is the data that the 
fusion mechanism recognizes as being “in common” between the two 
ears that are found to vary according to circumstances, if one class of 
data is not available in the stimuli, the mechanism finds another 

This land of flexibility is, of course, indicative also of correlation 
processes being used by the brain Before examining for the exact 
processes, let us illustrate this flexibility m the case of speech. 


The data in speech controlling the binaural fusion process 

In this series of experiments, intoned vowels were used (to remove 
transients) They were recorded on tape, which was subsequently 
closed into a loop to provide a continuous source 

men such an intoned vowel is presented bmaurally to a subject, 
the resulting "judgment curve" is exactly the same as that of Fig la 
for a random or quasi random source (or replottcd so as to eliminate 
arbitrariness of zero delay T, as in F,g Ic) This is indicative of the 
fact that the random breath sounds, closely similar in both cars, con- 
trol the fusion process, the formant periodicities are not apparent 

'but no can easily destroy the similarity of the breath sounds arriv- 
mg a ^rear by adding <0 one side a white noise “ “ 
Fig 2. What docs the mechanism fuse on now? F surcl e s lmvv s 
one example, with an average signal-.o no.se ratio of -0 decibels. 
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the difference now is that dips have appeared at values of T e that 
indicate that fusion is taking place on the periodic components 
(formants) of the intoned vowel source (which have known fre- 
quencies) Thus the fusion mechanism is operated by common breath 
sounds, under quiet conditions, but partly by the common formants 
when breath tones are rendered useless. 



T, (msec) 
(«> 



Fig. 4. Judgment cur.cs for (a) t he intoned vowel a, noise-masked in the left 
ear only, with signal- to- noise ratio -9 db (theoretical curve dashed), (b) two 
sinusoids (COO and £00 cps. equal level) 

We shall not repeat details of the calculations here but merely state 
that the correspondence between the dips in the curve of Fig. 4a and 
the formant frequencies of the intoned vowel has been examined by 
Sajcrs (Sa>ers and Cherry, 1957), his calculated “judgment curve" 
is shown dashed on Fig. 4a Note the coincidence of the peaks Such 
a calculation, based on the mathematical model to be described m the 
next section, is possible with some accuracy because the formant fre- 
quency and decrements can be measured accurately in this case of 
a steadily intoned vowcL 
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The asymmetry of the curve of Fig 4a representing our "precedence 
effect” should also be noted 


The Model of the Binaural Fusion Process 


The few curves we hn\e illustrated here (Figs 1, 3, 4) are typical 
of a very large number resulting from measurement with many dif- 
ferent types of sound source and different listening subjects (Sayers 
and Cherry, 1957) 

Let us look now at the steps mvolved in building up a theoretical 
model, by which such fusion "judgment curves” may be calculated 
for all sorts of sound sources — pure tones, chords, intoned vowels, 
noise, running speech. 


Tho First “simple model" of binaural fusion 

The simplest interaural correlation process is illustrated by Fig 5 
Briefly, we argue that the L and R signals, Sx.(f) and S*(£), are 
cross correlated and that the correlation function 7? j 2 C ^ T ) 15 repre 
sented upon a “conceptual surface,” as shown by the dashed curve 
Then the “judgment mechanism" decides whether the function lies, 
on an average, more to the left or more to the right of the mid- ("intra- 
cranial”) line The measure used does not appear to be critical, since 
only a dichotomous judgment is needed, for our calculations, we 
have used the normalized areas lying, left and right, under the curve 
R 15 >(f, t) One additional operation is necessary at this stage, before 
cross correlation, the signals Sr(t) and S„(<) need to be combined 
with then- own mean values /U and A„ (the method of combination 
used tentatively at this stage is simple linear addition) The reason 
for this is that these mean intensities can themselves also control the 
It L judgment and so must appear upon the conceptual surface 
This oversimplified operation is clarified in our final model, as de- 
scribed m a later section , t. 

Conventional cross-correlation is represented by the function 


4>i2 (r ) 


* r±". ST J_. T 


m ?(<+')* 


a) 


but clearly tins is not usable, since the aural processes arc proceed 
«ig t rn^auif time, and intcgratioi^^cr ^e fumrc^tf^mrcahshc.^^gain, 
the brain cannot integrate ■ £ Muircs of sound m oie about, 

mails are made moment by , ro „„ ing concision. 

Clearly . the process can only be one of short term s. 
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Judgment of intracranial position 
Judgment mechanism 


t 

Time averaging 

t 





\ t 


Transformation Transformation 

t t 

Le't-SKle input SJO R ght-sde input S^t) 

IJg. }. Model of the simple cross-correlation theory of binaural fusion. Note 
that the running cross-correlation unit has been divided in two parts only for 
clarity of representation. 


represented most generally by 

Hull, T> = J* J{ 1 - TMl - T - ,)ffm dT (2) 

where W(T) is a weighting function, corresponding to a short-term 
memory The exact form of this function is not critical, we have 
found, but it is convenient to assume that it is exponential. 


W{T) 


H, 


( 3 ) 
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Lickhder (1951) used a similar running correlation function in Jus 
model of pitch perception We have estimated the time constant K 
here as approximately 6 msec This running cross correlation is illus- 
trated by Fig 6 


Now 



rig 6 Short term running correlation, as from Eq 2. 


Theoretical results, based upon the " simple model" 

The key to the exact form of the correlation process was first ob 
tamed from fusion measurements based upon single and multiple 
pure tones (see, for example. Figs lb and 4b) Such curves appeared 
to be closely similar to the correlation functions of the signals them 
selves (for such steady state signals no short term memory t~ KT was 
involved) 

Notice one important thing about these curves, typical of all our 
measurements, they are “sawn -off* sharply at the 100-pcr-cent and 
0 per-cent levels, corresponding to the listeners absolute certainly 
of L or R \Ve shall cite only one example here Fig 4b * shows 
(dashed) the calculated curve based upon the “simple model* when 
600 and 800 cps are applied together bimurall) This docs not, of 
course, recognize die existence of these 100 per cent and 0 per-cent 
limits, since the model contains no “thresholds of absolute confidence 


The complete model of binaural fusion 

The “simple model' of Fig 5 was found to be inadequate for scicral 
reasons Predominant is the fact that the L-R lateral ino.emct of 
a fused sound image is controlled also liy the rcWive 
the two ear signals rU and .t s , as well as by tlitir timing in relation 
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to one another Wo have made many measurements c£ fusion ‘judg- 
ment cuncs," using, m particular, multiple smusoids and intoned 
vowels, and have found the following a most surprising result. 

Briefly, if the form of the signals S L (t ) and S«(f) is held constant, 
but their relatiYO mean intensities AJA„ are varied, the form of the 



Fig 7. Judgment curves for two- sinusoid binaural signal, showing influence of 
interaural amplitude diSereuce (600-cps component, — Z db with respect to 
800-cps component in each channel) (a) S*,(t) above S*(t) by 10 db, {&) 
by 8 db, and (c) by ~6 db Theoiebcal curves dashed. 

judgment curve itself also remains invariant, however, thic judgment 
curve is moved bodily, up and down, between the “sawn-off* 100-per- 
cent and 0 per-cent limits Figure 7 shows one typical result, using 
two smusoids, 600 and 800 cps, as in Fig. 4 b also It is as though the 
100- and 0-per-cent limits form a mask, behind which the judgment 
curve slides up and down, as controlled by the ratio AiJAg 



TWO EARS— BUT ONE WORLD TT3 


Such a result is very sunply accounted for if the relative signal am 
phtudes An and A n at the left and nght ears axe assessed, by some 
averaging process, and used to weight the L and fl sides of the corre- 
lation function fli 2 (f,r) (see Fig 5) accordingly, before the final 
judgment process decides whether this function lies predominantly 
L or R of the mid- (intracranial) line 
Such an assessment of A L and A B might well result from an addi 
tional autocorrelation process, operating on the signals at each ear 
separately A gain, if this is short term, as for the cross-correlation 
f^i 2 (f>T) in Eq 2, running average values are assessed, At(f ) and 
A/j( i), so that these are themselves functions of time Such a method 
of assessment provides, inherently, a theoretical basis for other binaural 
phenomena, to be referred to later 
The complete model upon wluch calculations have been based is 
now shown in Fig 8 and is described m detail in our 1957 paper 
Briefly, the signals at each ear Sz(t) and S B (t) are first subjected to 
separate running autocorrelation, and the two resulting functions are 


then cross-correlated 

This cross correlation function Rn(t t) is subsequently represented 
on a "conceptual surface” but, before the "judgment mechanism de 
cides whether this function lies predominantly L or R of the midline, 
the two parts of the function (L and R) are weighted by a long term 
time average of the two autocorrelation functions, Rii, (/, r) and 
Rn R (t, T ) There are, therefore, three "conceptual surfaces involved, 
one at each ear and one in the center , 

Such a model is, of course not neurophysiological, but functional. 
it has been used for calculating L R hteial 'judgment curves, with 
many forms of binaural signal, cither identical at each oar or none 
masked at one ear We cannot cite rainy here (other than Fig , 
-16, and 7) but would refer the interested render to the original (Sayers 


and Cherry, 1957) 


Some Further Aural Effects Predicted by the Model 

1 When the left or nght ear on*. S^y’the 

centrally fused subjectrve image is formed In me m ■ ' 

L or R 'conceptual surface,' and not the 

Similarly, when the L and R signals are horn statist, caHy independent 
sources the same result obtains (Cherry, , n v t,as no f re 
2- When a complex source of; ’SS cp* it is an 
quency components below approxim y signals fuse 

experimental fact that only the envelopes of the L and H s,g 
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into a binaural image, their microstructures do not operate the fusion 
mechanism (Leakey, Sayers, and Cherry, 1958). Dut when the signals 



Running 1 

I 


autocorrelation 1 


autocorrelation 


Left-sde input S L (t) 


R ght-side Input S^fO 


"o 8 * Model including the proposed extensions to the basic theory, from which 
calculations have been made 


axe identical (or different) pure tones above about 1500 cps, with no 
varying envelopes, no fused image forms This phenomenon assists 
our directional hearing of signals, all of whose components of wave 
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length are shorter than twice the distance between the ears Our 
fusion model does the same, smce, under these conditions, the running 
autocorrelation processes at each ear automatically pass on only the 
•envelopes (running averages) to the central cross correlation process 

3 When listening bmaurally to multiple sinusoids, the subject can, 
at will, listen to the whole fused chord, or to any one, or any group, of 
fused tones as distinct Gcstalten Again, by independent time delay 
controls, each of these different fused images can, subjectively, have 
independent lateral movement Our model also enables us to calcu- 
late the “judgment curves” of lateral image positions for each of the 
images separately (Sayers and Cherry, 1957), smce the preliminary 
autocorrelation processes perform the necessary periodicity discrim- 
inations 

4 If, with binaural stimulation, one ear is, in addition,^ mashed by 
noise, or if other perturbation is introduced, a kind of precedence 
effect” arises which we noted m an earlier section Voclchcr has 
shown that tins can be explained only in terms of the short term 
auto- and cross correlation processes used m the model, but lus work 
is not yet published 


The relations between our model and the proposals made by Lich- 
hder (1956) concerning pitch perception might be noted here Vc 
have adopted the same running-correlation function as Inchlider and, 
like him, we require both auto and cross correlation (although the 
order rs reversed in our model) But our preimscs and experimental 
data are utterly ddferent m kind Licklider is concerned ' 
physiological facts, we are not Again, it is on pi c percep 
concentrates, rather than upon the various fusion P^nornona Wc 
have been concerned solely with bedding a functional model of fusion 
processes, that rs, with a set of mathematical operations whereby 
we can calculate numerically the probabdi y o 

overstressed that models such as 
as “one of a Cass of models". 

matical processes cited and recast > . processes 

alternative processes Thus “ v "°, " L‘ interpreted by Sltcnng 
to correlations, correlations can w „„„mnh\£iolocv that 

processes instead And, once again, - j- ££2* T&Si 
concerns us here, but representations of behav ,or-entiru> 
v lew" of the human black box. enaction of datutiml 

A great many perception phenome ^ CaUllcn arc rccog- 

invariants from among the raw sense 
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mzed under many forms of transformation and perturbation— both 
determinate and statistical, for example, noise Again, the percep- 
tions operate in real time, moment by moment. 

Short term correlation thus seems to be a logical choice for .the 
extraction of such invariants We end then with a somewhat vague 
suggestion that perhaps the various hierarchical processes of percep- 
tion may be described by hierarchical successions of such correlations 
The question is whether short term correlations may not be the bricks 
of which part of the house of perception is built 


Summary 

In this paper a mathematical model is developed which is desenp 
hve of the binaural fusion process Fusion of all types of signal can 
be han dled sinusoids, noise sounds, intoned vowels, ru nn i n g speech, 
and so forth. 

It is entirely stimulus response behavior that is described by the 
model, and no reference whatever is made to the physiology or 
anatomy of hearing. With the model calculations can be made, m 
complete detail, of the probability with which a human listener will 
hear sounds as lying to the right or left side of his intracranial plane, 
when binaural stimuli are presented with some mutual tune delay 
The two signals (left-ear, nght ear) need not be identical, fusion still 
occurs when they are different, as in fact they always are in real life 
Again, the mathematical model will assess the probability of fusion. 
Only if the two signals are totally unlike (statistically independent) 
are we assured of total failure of fusion. 

The details of the model have already been published elsewhere 
In the present paper attention is paid particularly to the arguments 
underlying the development of the model Its relation to models of 
other hearing processes is clarified. 

Finally, some typical examples are cited, comparing the experi- 
mental and the calculated results, and some general implications of 
the model for binaural perception are drawn. 
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^ripheral Coding 
of Auditory Information 


Neurophysiological Specifications for the Form 
of the Auditory Code 


The most strategic spot at which to examine and assess the informa- 
tion that enters the central nervous system through its sensory input 
systems is at the level of the primary afferent neuron In some s> stems, 
such as the cutaneous and the muscular, the afferent neurons are 
grouped into many separate afferent nerve bundles, but in others, 
notably the visual, olfactory, vestibular, and auditory s> stems, a single 
pair of cranial nerves carries the entire sensory input of that par- 
ticular sense modality In the auditory nerve it is possible to record 
the action potentials of peripheral neurons that have not yet made 
synaptic connections It is particularly appropriate to use a “ d ‘ t0I > 
system as an example o£ neurological coding ^cause 
theory, which is obviously the background for our thmkmg 
problem, has developed primarily around au ory j j or 

Perhaps the auditory system can serve as a model in dm regard 

of myelinated nerve fibers of q “ n ,| ll . r a|I y die spiral 

microns in diameter) The ceU bod ^ ^ ^ bipollr „, 1ISi 
gangbon ms.de the cochlea ( ^ L gv ngbon are recognized (There 
and no synaptic connechons ■“ “' s 8 oItrocod Jur bundle or 

is also a relatively sn.aU efferent i bun , ^ ipsllatera j an j ,hc 

tracts of Rasmussen, with cell bo ^ n0 , k ‘ 10W lU function, 

contralateral olivary complex We y lola l uumbcr of 

and I shall disregard it deliberate* ) “ MOT i a.OCO and 

afferent Hbers, based on ganglion b number of 

30,000 m cacli car The number is about die same 
119 
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sensory cells (hair cells), and the distribution of both &anghon ceUs 
and sensory cells along the length of the basilar membrane of the 
cochlea is approximately, although not absolutely, uniform (Davis, 
1937, 1959, Stevens and Davis, 1938) 

The relation of one set of sensory cells, the internal hair cells, to 
nerve supply is fairly simple As a Erst approximation, we may UunK 



Fig. i. Mid modiolar section of the cochlea of a guinea pig The cochlear 
canal is cut across nine tiroes. The auditory ner\ e passes down the center of the 
modiolus and through the internal auditory meatus to the brain. The cell bodies 
he in the spiral ganglion between the nerve proper and the organ of Corti on 
the cochlear partition. In this particular animal the nerve cells and the organ 
of Corti. of the first turn had degenerated, following exposure to a high intensity 
tone of high frequency 

of two or three internal hair cells as connecting to each fiber, and of 
each hair cell as receiving innervation from two or three nerve fibers 
(Fig. 3) A similar relation apparently holds for one class of fibers, 
the radial fibers, to the external hair cells, but each of the more numer- 
ous external spiral fibers innervates many cells, and each cell receives 
innervation from a number of fibers (Fernandez, 1951, Stevens and 
Davis, 1933) 

In the overlapping innervation of hair cells, we find what may be 
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analogous to the penpheral nerve net of the cntaneons system and 
to the extrafoveaf reg.ons of the retina, where a considerable amount 
°, P ° f ““elation seems to be the rule The overlap may be 
simply part of Datures margin of safety or redundancy in the struc 
ture of the nervous system (I tell my students, “No single neuron 



fig 2 Camera htcith drawing of a cross section of lie cochlear partition In tho 
second him of the cochlea of a guinea pig The bipolar nerve cells of tho spiral 
ganghon appear at the lower left The nerve fibers have no m>cbn sheath In 
the organ of Cortf. Tho sensory cells arc die interna] and external hair cells. 

unless it be an anterior bora cell, is ever completely indispensable ”) 
The overlap of innervation also provides at least the anatomical possi 
bihty of some interaction among sensory cells prior to the setting up 
of the familiar all or none impulses of medullatcd nerve fibers The 
penpheral endings of the sensory nerve fibers are nonmcdullated 
throughout the organ of Corti They arc completely naked, without 
even a Schwann cell sheath They should be regarded as a dendntic 
system. Conduction by electrotonus only or by decremental conduc- 
tion is common w dendntic systems We have no nght to assume 
that all-or none impulses followed by refractory penods anse until the 
auditory fibers acquire their myelin sheaths in the habenula perforata. 

In the auditory nerve where it passes through die internal auditory 
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meatus, however, we find all or none impulses, separated in a given 
fiber by tune intervals of at least 1 millisecond and usually somewhat 
longer (Tasaki, 1934) This form of conduction, typical of peripheral 
nerves and of white matter in the central nervous system, constitutes 
a very severe limitation for the coding of incoming sensory information 



fig 3 Plan of innervation of the hair cells, according to Lorente de N6 (in 
Wever, 1949) The intraganghomc spiral bundle is composed of efferent fibers. 
It arises in the medulla and does not have its cell bodies in the spiral ganglion 


The dominant problem of auditory theory for many years has been 
to determine how the physical dimensions of the monaural acoustic 
stimulus, namely, frequency and intensity, are coded in the form of 
all or none impulses, separated by silent intervals and travebng in a 
large number of parallel, insulated channels A second problem, 
which has attracted more and more attenbon in recent years, has been 
the coding of the information that forms the basis of sensing the direc 
bon from which sound arrives Binaural differences in time become 
very important in this problem. 


The Classical Rules and Possible Mechanisms 
for Coding Auditory Information 

From the work of Bekesy and many others, we know that the 
coclilea acts as a mechanical acousbcal analyzer, and that the position 
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of maximum mechanical activity along the basilar membrane changes 
as a function of frequency (Bekesy and Rosenbhth, 1951, Davis, 
1958a) It seems inescapable that an important part of the mforraa 
tion concerning the frequency of the incoming sound waves partial 
Iarly those of high frequency, is carried by the built m assignment 
of certain channels (nerve fibers) to certain parts of the basilar mem 
brane This arrangement takes advantage of the differential response 
to frequency of different sections of the cochlea Tins is the place 
principle of coding of frequency (pitch) information 
It is also well established that for lower frequencies (below 2000 
cycles per second or thereabouts) nerve impulses m the auditory 
nerve tend to group into volleys because each sound wave acts as a 
separate stimulus Volleys of nerve impulses following one another 
at the frequency of the original sound waves have been observed in 
the pattern of nerve impulses as they pass up the auditory nerve 
(Bekesy, 19596, B^kdsy and Rosenbhth, 1951) It has been abun 
dantly proved m psychophysical experiments that information con 
ceming the frequency of a low tone (its pitch) can be transmitted by 
nerve fibers other than those that are excited by a pure tone of that 
frequency A high frequency noise that is periodical!) interrupted 
causes us to hear a low frequency pitch — its 'periodicity pitch* — 
in addition to its high frequency spectral pitch. This is the so 
called frequency or volley principle of auditory coding (Davis, Sd- 
verman, and McAuhffe, 1951, Licklider, 1951, Miller and Taylor, 
19-18) 

In some sensory systems the frequency of discharge of impulses m 
each afferent fiber is an important method of coding information con 
ceming the intensity of the stimulus This form of coding seems to 
bo of little significance in die auditory system Lx penmen tally it 
appears that the dynamic range over which the frequency of discharge 
in a given fiber is also a function of intensity is very limited not more 
dun 20 or 25 decibels ( Galambos and Davis, 19-J3, katsuU et al , 
1958) Very likely the information concerning die mtcnsitv of an 
acoustic stimulus is more dirccdy represented by die total number 
of fibers that are active than by die total number of nerve impulses 
In a given length of tuna 

A dnrd method of convey mg (coding) intensity information is by 
a sy stematicaliy graded set of thresholds among the receptor units 
and a correspondingly graded contribution per fiber to the ultimate 
sense of loudness In the car Uicrc are two dimensions of systematic 
grading both of them probably significant Tint die different rows 
of sensorv cells probibly have different thresholds and perhaps dif* 



122 HAliOWELL DAVIS 

meatus however, we find all-or none impulses, separated in a given 
fiber by tune intervals of at least 1 millisecond and usually somewhat 
longer (Tasaki, 1934) This form of conduction, typical of peripheral 
nerves and of wlute matter in the central nervous system, constitutes 
i very severe limitation for the coding of incoming sensory information 



Fig 3 Plan of innervation of the hair cells according to Lorente de N6 (m 
Wever 1949) The intragangliomc spiral bundle is composed of efferent fibers 
It arises m the medulla and does not have its cell bodies in the spiral ganglion 


The dominant problem of auditory theory for many years has been 
to determine how the physical dimensions of the monaural acoustic 
stimulus, namely, frequency and intensity, are coded in the form of 
all-or none impulses, separated by silent intervals and traveling in a 
large number of parallel insulated channels A second problem, 
which has attracted more and more attention m recent years has been 
the coding of the information that forms the basis of sensing the chrec 
tion from which sound arrives Binaural differences in time become 
very important in this problem. 


The Classical Rules and Possible Mechanisms 
for Coding Auditory Information 


From the work of Bekesy and many others, we know that the 
codilea acts as a mechanical acoustical analyzer, and that the position 
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of maximum mechanical activity along the basilar membrane changes 
as a function of frequency (Behesy and Roseubldh, 1951, Davis, 
1958a) It seems inescapable that an important part of the informa- 
tion concerning the frequency of the incoming sound waves, particu- 
larly those of high frequency, is carried by the built in assignment 
of certain channels (nerve fibers) to certain parts of the basilar mem 
brane Tins arrangement takes advantage of the differential response 
to frequency of different sections of the cochlea Tins is the place 
principle of coding of frequency (pitch) information. 

It is also well established that for lower frequencies (below 2000 
cycles per second or thereabouts) nerve impulses in the auditory 
nerve tend to group into volleys because each sound ware acts as a 
separate stimulus Volleys of nerve impulses following one another 
at the frequency of the original sound waves have been observed m 
the pattern of nerve impulses as they pass up the auditory nerve 
(Behesy, 1959b, Bekesy and Rosenbhth, 1951) It has been abun 
dantly proved in psychophysical experiments that information con- 
cerning the frequency of a low tone (its pitch) can be transmitted by 
nerve fibers other than those that are excited by a pure tone of that 
frequency A high frequency noise that is periodically interrupted 
causes us to hear a low frequency pitch — its "periodicity pitch" — 
m addition to its high frequency spectral pitch. This is the so- 
called frequency or volley pnnctplc of auditory coding (Davis, Sil- 
verman, and McAuhffe, 1951, Lichbder, 1951, Miller and Taylor, 
1948). 

In some sensory systems the frequency of discliarge of impulses in 
each afferent fiber is an important method of coding information con- 
cerning tlie intensity of the stimulus Tins form of coding seems to 
be of little significance in die auditory system experimentally it 
appears that the dynamic range over winch the frequency of discharge 
in a given fiber is also a function of intensity is very limited not more 
tlian 20 or 25 decibels (Calambos and Davis, 1913, Katsuki, ct al , 
1958) Very likely the information concerning the intensity of an 
acoustic stimulus is more directly represented by the total number 
of fibers that are active than by die total number of nerve impulses 


in a given length of time. . 

A third method of comeymg (coding) mluisUy information is by 
a systematically graded set of thresholds among the receptor .mils 
and a correspondingly graded contribution per fiber to the nit, mate 
sense of loudness In the ear there arc too dimensions of systematic 
grading, boll, of them probably significant- F.rst, the different tons 
of sensory cells probably base different thresholds and perhaps d,f- 
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ferent importance centrally Second, for a sound of a given frequency 
there is a point of maximum sensitivity somewhere along the basilar 
membrane Receptors that are located between this point of maxi 
mum sensitivity and the basal end of the organ of Corti can also be 
stimulated by this same tone, but its intensity must be increased 
Thus there is, for every audible frequency, a s>stcmatic grading of 
thresholds lengthwise along the organ of Corti 

For the lateralization of the source of a sound, the two parameters 
of binaural difference in intensity and binaural difference in tunc of 
arrival of corresponding sound waves arc both known to be significant. 
The importance of earlier arrival (the precedence effect), even by a 
small fraction of a millisecond, is well recognized The fixed ana- 
tomical separation of the two ears with the intervening acoustic baffle 
of the head gives a physical base on which nature has built into the 
nervous system a particular code, based on differences in tune and 
differences m intensity appropriate to low and to high frequencies, 
respectively, to carry information relative to the direction of arrival 
of incoming sounds Considerable attention has been given to the 
“trading relation” between intensity and time There is a peripheral 
trading relation between intensity and time, because neural latency 
depends not only on the physical tune of arrival of a sound but also 
on its m tensity 

It is not so well recognized that there must inevitably be certain 
small scale tune differences (differences in latency) among the re 
sponses of the nerve fibers of an individual ear The traveling waves 
on the cochlear partition impose certain necessary differences in the 
tune of arrival of the crests of the waves at different points along the 
cochlea (Fig. 4) This is true for acoustic transients as well as for 
sustained tones These time differences are a function of the fre 
quency of the sound waves in question (Tasaki, Davis, and Legouix, 
1952) If it is true (and I believe that it is a good general rule) that 
the central nervous system never fails to make some use of every bit 
of information that is presented to it, the consequent systematic dif 
ferences in time of arrival of impulses in different nerve fibers are 
presumably utilized — although for what purpose we do not know 
Some other observations however, allow us to speculate concerning 
possible roles for these systematically delayed impulses and also for 
the synchronized volleys of impulses described earlier 
We do know, as a matter of experimental observation, that mdi 
vi dual elements in the midbram and cortex are more “sharply tuned” 
than are the neurons of the auditoiy nerve (Katsulu, et aL, 1958) It 
seems clear that inhibitory processes are at work and brmg about the 
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The Place of the Auditory System in 
Phylogenetic Development 

Some insight into the coding and the nature of organization of each 
sensory system may be obtained from die generalizations formulated 
by G H Bishop (1959), elaborating the concepts of Judson HerncK, 
in relation to fiber size and to phylogenetic development Bishop 
points out that the central nervous system seems to have evolved y a 
process of successive accretion In the simplest, earliest vertebrates 
the primitive system was composed of relatively small nerve fibers 
that conduct rather slowly Trom the sensory point of view the system 
was sensitive, and from the organizational point of view it was well 
integrated Responses tended to involve large parts of the body in 
rather complicated patterns of movement for orientation, escape, foo 
taking, and so forth The reticular formation of the central nervous 
system represents part of this primitive organization and so do some 
touch and pain fibers The cutaneous nerve net is probably a survival 
of this primitive form of organization 
To the original primitive nervous system, which nevertheless still 
persists m the higher and more complicated forms, has been added a 
second and m some cases a third system These new systems supple- 
ment but do not displace the old The newer systems are charac- 
terized by larger, faster fibers and by discrete innervation The latter 
feature makes possible finer discrimination, better localization, and 
“mensuration” The absolute thresholds of the newer systems may 
not be so low, but the differential thresholds are smaller 
The existence of two sets of receptors, fibers, and even centers for a 
given modality is in itself a form of codmg, but we should not try to 
interpret this situation without looking at the organism as a whole 
I suspect that the key to this duality may be an ability on the part of 
the organism to accept or reject the information from one or the other 
of these systems as a whole, for example, by what we call “paying 
attention ” or perhaps to use one set of information for one purpose 
and another set for another purpose or set of purposes 

In the special case of the auditory system, the concept of old versus 
new does not turn out to be immediately or obviously helpful As 
nearly as we can tell, the cochlear system is a relatively late develop- 
ment in the evolutionary sequence It is certainly quite primitive in 
birds and reptiles and is fully developed only in the mammals The 
cochlear fibers are considerably smaller than the large fibers of some 
parts of the (muscular) proprioceptive system The labyrinthine 
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action potentials of the neck, muscles, respiratory muscles, and so forth, 
of the animal, and (2) asynchrony of discharge of the volley of im- 
pulses The asynchrony is due pirtly to the Bck£sy traveling wave 
(Davis, 1958a) when the frequency is low, and partly to systematic 
variations in latency with the intensity of stimulation This is sig- 
nificant because some fibers in each volley are always near their 
threshold of response There are also spontaneous fluctuations in 
latency, and in the guinea pig, at least, there is systematically a greater 
length of fiber from the apical than from the basal region 


CM and SP CM and SP 



AP AP 

M lb) 


Fig 5 Cochlear microphoruc (CM) and summating potential (SP) recorded 
by mtracochlear electrodes from the basal turn of a guinea pig. The stimulus 
was a 7000-cycle tone burst with a 1 msec nse time The stimulus for the oscillo- 
gram m (a) was just maximal for CM for the oscillogram m (b) it was 10 db 
stronger Note that the CM is smaller m (6), whereas the SP (the d-c displace- 
ment of the upper trace) and the action potentials (downward “spikes” m the 
lower trace) are larger (From Davis, Deatherage, et aL, 1958m) 


We have concentrated on two forms of stimulus that evoke well- 
synchronized volleys of action potentials from known regions of the 
cocWea (Fig. 5) The first is the onset of a tone burst at 7000 cps 
with a nse time of 1 millisecond. (We use 1 msec nse bme m order 
to mmimize the scattering of acoustic energy to other frequencies 
The frequency, 7000 cps, is selected as bemg high enough to give 
summated stimulation, without the volley effect, that is determined 
y die shape and amplitude of the envelope and quite mdependent 
of the individual sound waves, at the same time 7000 cps is low enough 
to allow us a wide dynamic range within the limits of our transducer; 
and finally it stimulates at a point on the cochlea that is anatomically 
accessible for the placement of electrodes ) The pattern of action 
potmhal response to such a stimulus is the familiar spike (N t ) fol 

I°W T 57 SUCCCS r ely SmaUer spJces W* and N 8 ) at intervals of 
about 1 msec. This interval represents the refractory period of the 
auditory nerve fibers Incidentally, we should think of 7000 cps to 




